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Credit where credit is due 


A proposed author ID system is gaining widespread support, and could help lay the foundation for an 
academic-reward system less heavily tied to publications and citations. 


nhis classic book Management Teams, UK psychologist Meredith 
Belbin used extensive empirical evidence to argue that effective 
teams require members who can cover nine key roles. These roles 
range from the creative ‘plants’ who generate novel ideas, to the disci- 
plined ‘implementers’ who turn plans into action and the big-picture 
‘coordinators’ who keep everyone working together. 

Much the same range of roles is critical for science. Unfortunately, 
the academic system tends to reward only some of those activities 
— notably those that have easily measured outcomes, such as the 
publication and citation numbers so heavily weighted by promotion 
and tenure committees. 

On page 843, Nature profiles a research group composed largely 
of what Belbin would call ‘completer finishers’ — perfectionists who 
are driven to fix all the flaws, fill all the gaps and get the job finished 
correctly. This particular group is trying to complete the reference 
human genome sequence, which is still full of errors nearly a dec- 
ade after the first draft was announced in 2000. It is essential work: 
modern sequencing techniques still use the reference to anchor new 
data even as they grind out genomes at a fraction of the cost of the 
original. But it is also work that offers few academic rewards beyond 
the satisfaction of a job well done — it is unlikely to result in a high- 
profile publication. 

Such unsung contributions to science may soon be easier to evalu- 
ate and quantify through an author ID system proposed earlier this 
month and backed by 23 organizations, including Thomson Reuters, 
Nature Publishing Group, Elsevier, ProQuest, Springer, CrossRef, 
the British Library and the Wellcome Trust. The Open Researcher 
and Contributor ID (ORCID) would be an alphanumeric string that 
uniquely identifies an individual scientist in much the same way that 
a Digital Object Identifier uniquely identifies a paper, book or other 
scholarly publication (more details and the complete list of partici- 
pants will soon be available at www.orcid.org). The system would 
distinguish between the world’s multitudinous Dr Smiths and Profes- 
sor Wangs, but would not be affected by name changes, cultural dif- 
ferences in name order, inconsistent first-name abbreviations or the 
use of different alphabets. It would be attached to researchers journal 


publications, and could also be assigned to data sets they helped to 
generate, comments on their colleagues’ blog posts or unpublished 
draft papers, edits of Wikipedia entries and much else besides. 

This kind of ‘microattributior could ultimately make it possible 
for each researcher to have a constantly updated ‘digital curriculum 
vitae’ providing a picture of his or her contributions to science going 
far beyond the simple publication list. 

ORCID is hardly the first proposal for an author ID system. A 
number of publishers have been exploring the idea, and the Interna- 
tional Organization for Standardization in Geneva is developing an 
international standard name identifier to track contributors to media 
content such as books, television pro- 
grammes and newspaper articles. But 
most of those developers have already 
joined or are working closely with the 
ORCID group. Moreover, the intention 
is to make ORCID freely available for 
anyone to use, and interoperable with 
existing ID systems. 

The next step is for the ORCID 
group to turn the concept into a working system. That is scheduled 
to happen over the next six months, with the software being based 
on Thomson Reuters’ existing ResearcherID system. In parallel, the 
group will be setting up an independent organization to run the sys- 
tem and assign ORCIDs to individual researchers. 

There will be many challenges along the way — not least of which 
is establishing rigorous protocols for validating and authenticating 
ORCID assignments. No one wants to see the system abused by indi- 
viduals seeking to pad their academic credentials. 

But perhaps the largest challenge will be cultural. Whether ORCID 
or some other author ID system becomes the accepted standard, the 
new metrics made possible will need to be taken seriously by every- 
one involved in the academic-reward system — funding agencies, 
university administrations, and promotion and tenure committees. 
Every role in science should be recognized and rewarded, not just 
those that produce high-profile publications. rT] 
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curriculum vitae’. 


Mind the gap 


It will take time to assess the value of fresh 
approaches to science and technology studies. 


sciences has historically been fraught. ‘Hard’ scientists have 
often treated the social sciences with disdain. For example, 
some of them fought, successfully at first, to exclude the social 


T= relationship between the social sciences and the natural 


sciences from the remit of the US National Science Foundation. And 
those social scientists who studied science itself, under the remit of 
science and technology studies, often returned the favour, seeming 
on occasion to be devoting themselves myopically to demonstrating 
that the scientific emperor had few, if any, clothes. 

There remains something of a dialogue of the deaf between these 
two wings of the academy, separated as they are by language, custom 
and methodology. But barriers are coming down. Senior scientists 
and administrators, especially those in socially contentious areas 
such as climate change and reproductive technologies, realize that 
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they need to collaborate with scholars of society-at-large. Sociologists 
and philosophers of science, in turn, are acquiring a more intimate 
understanding of the scientists that they study. 

These promising developments are being driven by a wider politi- 
cal context. In the United States, the events of 11 September 2001 
and the subsequent wars in Afghanistan and Iraq led to a reassess- 
ment of the role of social-sciences research, particularly in regard to 
its relevance to national security. This led to firmer, bipartisan sup- 
port for the social, behavioural and economic sciences directorate 
at the National Science Foundation. In Europe, meanwhile, strong 
public suspicion of new technologies — which has had particularly 
devastating consequences for the deployment of genetically modified 
crops in Europe and beyond — has encouraged governments to set 
aside more resources for the early involvement of social scientists in 
technology development. 

In this issue, we report on the strengths and weaknesses of a UK 
initiative to bring the social sciences to bear more effectively on 
genomics, and on the life sciences more generally (see page 840). 
This experiment suggests that coherent, multidisciplinary centres 
can help social scientists to get a firmer grip on the complex science, 
cultures and behaviours underlying new technologies. But it also 
highlights the need for funding agencies, such as the UK Economic 
and Social Research Council, to retain a close interest in the strate- 
gic direction of such centres, and to ensure that their successes and 
failures are noted and built upon, even after their direct funding 
has expired. 

The increased involvement of social scientists in science and tech- 
nology issues has been especially pronounced of late in nascent fields 
such as nanotechnology and synthetic biology, where funding agen- 
cies feel that they have to tread carefully lest their work unleashes a 
backlash from the public. 

In these areas, it is too early to assess the value — and benefici- 
aries — of the social scientists’ contribution. The idea of embed- 
ding sociology, law and philosophy firmly in the development of a 
scientific discipline from the outset is only now being tested. There 


is optimism among many of the engineers and natural and social 
scientists involved. 

However, all the signs are that the various parties are approaching 
these collaborations very much on their own terms. Natural scientists 
are under pressure to deliver new insight and applications. As far as 
they are concerned, if social scientists wish to observe them, that’s 
probably tolerable. 

Social scientists, in turn, wish to be respected for their insight 
into how scientists and their ideas function both within their 
communities and, above all, in relation to societal ambitions and 
values. These researchers do not, by 
and large, see their purpose as being to 
pre-empt societal reactions or public 
engagement, or to help natural scien- 
tists communicate. Moreover, for them, 
the tension between collaboration and 
detachment in these projects is real. 

There is a possibility, therefore, 
that these parties will end up walking 
and talking past each other. What is 
more, the management of rigorous 
programmes involving both groups is 
hampered by the difficulty that the social scientists (and those who 
support them) have in reaching agreement on what constitutes out- 
standing analysis of human practices in these contexts: just how that 
can best be achieved, and to what extent, should be at the service of 
government policy goals. 

None of this should encourage a dismissive attitude among 
sceptics. The applications of genetics, nanotechnology, synthetic 
biology and other technologies are giving rise to substantial new 
challenges in professional practice and communication, in ethics, in 
intellectual property and in many other dimensions beyond the sci- 
ence itself. Objective insights into these dimensions have their own 
value, and the new collaborations should help. The challenge remains 
to identify how that value can best be fulfilled. a 


“Coherent, 
multidisciplinary 
centres can help 
social scientists to 

get a firmer grip 
onthe complex 
science, cultures and 
behaviours underlying 
new technologies.” 


A class of their own 


The Japanese winners of Nature's mentoring awards 
have the universal qualities of outstanding advisers. 


ideas. ... He actively seeks to gain international exposure 

for his young researchers by making them corresponding 
authors on his papers. ... His distinctive mentoring style — decisive- 
ness and respect for the individual — comes from the fact that he is 
not a pure product of the Japanese system.” 

“The thing that most surprised me when I joined the Oosawa 
group was that everybody called him ‘Oosawa-san’ [rather than 
the much more formal ‘Oosawa-sensei }. Also, rather than sitting 
in an office ... he walked around the lab collaring people and talk- 
ing with them” 

These two extracts are drawn from the enthusiastic nominations 


// [) r Kitano is always ready to invest in apparently absurd 
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of Hiroaki Kitano, head of Sony Computer Science Laboratories 
in Tokyo, and Fumio Oosawa, a biophysicist at Aichi Institute of 
Technology in Toyota — the respective winners of the 2009 ‘mid 
career’ and ‘lifetime achievement’ awards given by Nature for scien- 
tific mentoring. Since the awards’ inception in 2005, they have been 
held in a different country every year, and they have been judged 
each time by a multidisciplinary panel of leading scientists from that 
country (see go.nature.com/Rccbo4). 

An account of this year’s awards, which took place in Japan, can 
be found on page 948. As in previous years, the two winners display 
accessibility, a broad and insightful overview, and an ability to engage 
with young researchers on the latter’s own terms — qualities that 
seem to be common to outstanding mentors everywhere. There is 
no doubt that the Japanese system tends to be strongly hierarchical, 
but it was clear to the judges that, as in all other countries in which 
the competition has been held, qualities that buck such hierarchies 
lead to outstanding new generations. Congratulations to Oosawa 
and Kitano. 2 
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Monkey talk 


Proc. Natl Acad. Sci. USA doi:10.1073/pnas.0908118106 
(2009) 

The vocal communication system used by Campbell's 
monkeys may represent the most complex syntax-like 
structure yet found among animals. 

Karim Ouattara and Alban Lemasson of the 
University of Rennes in France and Klaus Zuberbihler 
of the University of St Andrews, UK, recorded and 
analysed the calls of males in six groups of free-ranging 
Campbell's monkeys in the rainforest of Ivory Coast. 

The males have just six basic types of call, but 
combine these in context-specific sequences to 
convey different information. Crowned eagles, for 
example, elicited four different sequences, and 
leopards three, according to how the male learnt 
about their presence — by seeing them, hearing them, 
or learning about them through the hearsay of other 
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monkey species. 


PHYSICAL CHEMISTRY 
Dual-aspect particles 


Phys. Rev. Lett. 103, 237801 (2009) 
Like their mythological namesake, Janus 
particles have two faces: one that attracts 
and one that repels a liquid. Scientists are 
interested in the nanoparticles’ behaviour 
because they mimic that of many biological 
and chemical molecules. 
When suspended in 
solution, the repellent faces 
cluster together, causing the 
particles to clump. Francesco 
Sciortino of the University 
of Rome La Sapienza and 
his colleagues have now 
found that this clumping 
affects gas-to-liquid 
phase transitions of the 
particles. Simulations of up to 
5,000 particles in solution show that 
the clumping creates unusual 
behaviour: contrary to 
expectation, the gas phase 
(pictured above) is more 


subsequent robustness of the ecological 
network. 

Carl Simpson and Wolfgang Kiessling 
of the Berlin Museum of Natural History 
propose an explanation for this relationship 
on evolutionary timescales. They say the 
‘diversity—stability’ relationship can be 
explained solely by the extinction of species: 
high species turnover needs to be buffered by 
higher species numbers. 

If this is true, then the diversity-stability 
relationship should be strongest when 
the extinction rate is high. Looking at 
historical coral reef data, they found that the 
relationship was historically strong during 
periods of high extinction, and weak during 
low-extinction periods. 


CANCER BIOLOGY 


& 


Tumours hate company 


Proc. Natl Acad. Sci. USA doi:10.1073/ 
pnas.0910753106 (2009) 

Keeping normally gregarious 
rats isolated from their own 
kind boosts their cancer 


suggest that prolonged exposure to large 
pulses of the stress-related hormone 
corticosterone may have contributed to 
tumour origin and growth. 


GEOLOGY 


Bubble batholiths 


Lithosphere 1, 323-327 (2009) 

In some mountains and plateaux, geologists 
find granite rocks that formed from 
magmas that had risen up through ‘floating’ 
continental tectonic plates. The rocks’ 

origin has often been attributed to melting 
underneath the continental plates, caused by 
convection in the hot mantle. 

However, Donna Whitney at the 
University of Minnesota in Minneapolis and 
her colleagues suggest that subduction — 
the downward thrusting of one plate under 
another during tectonic collisions — might 
be responsible. Using a numerical model, the 
researchers find that continental subduction 
can lead to melting of crustal slabs and 
percolating granitic magma. 


ordered than the liquid risk, according to POPULATION GENETICS 
phase (pictured right) and Martha McClintock sal, See 
the material expands as it cools. and her colleagues at Asia's common origin 
The researchers believe that the University of Chicago Science 326, 1541-1545 (2009) 
their simulations could prompt new in Illinois. Humans migrated from Africa into Asia, 


experimental work with Janus particles. 


They kept 20 rats alone and 20 
rats in groups of five. All 40 were 


along its southern coast and then down 
into Indonesia. But whether this wave also 


ECOLOGY genetically prone to mammary cancer. accounted for east Asian populations or was 
. The lone rats exhibited a 135% increase in supplemented by one or more later migratory 
Reef regulation the number of tumours, an 8,391% increase waves along a northern route has been the 


Proc. R. Soc. B doi:10.1098/rspb.2009.2062 (2009) 
An ecosystems stability is postulated to 
increase as its number of species goes 

up, owing to the increased number of 
interactions between those species and the 
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in the size of tumours and a 3.3-fold increase 
in the relative risk of malignancy compared 
with those kept in groups. 

Isolated rats were also more stressed, 
anxious, fearful and vigilant. The authors 
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subject of debate. 

The HUGO Pan-Asian SNP Consortium 
reports an analysis of nearly 55,000 variations 
in genes from nearly 2,000 people that 
supports the single-wave theory. The analysis 


finds a high degree of overlap between 

the genomes of all southeast Asians and 
east Asians, lesser genetic similarity with 
caucasian populations, and a decreasing 
genetic diversity from southern to northern 
China, suggesting that humans entered Asia 
ina single primary migratory wave. 


CHEMISTRY 


One-hit wonder 


Nature Chem. doi:10.1038/nchem.477 (2009) 
Ammonia provides the nitrogen for most 
synthetic chemicals. But its industrial 
synthesis from nitrogen gas relies on high 
temperatures and pressures, and gobbles 
up fossil fuel. 

Building on previous work on splitting 
the strong nitrogen-nitrogen triple bond, 
Paul Chirik and his colleagues at Cornell 
University in Ithaca, New York, have now 
coaxed nitrogen to react with another 
abundant gas, carbon monoxide, in one 
room-temperature step. The reaction forms 
carbon-carbon and carbon-nitrogen bonds, 
the backbones of many useful chemicals, and 
is orchestrated by a compound containing 
the rare metal hafnium. This compound is 
not catalytic, so is unlikely to find widespread 
use. But the reaction’s nitrogen-weakening 
mechanism may inform new ways to assemble 
complex molecules from simple gases. 


PALAEONTOLOGY 


Dawn of the anomodonts 


Proc. R. Soc. B doi:10.1098/rspb.2009.0883 (2009) 
The anomodonts were mammal-like reptiles 
that were widespread from 270 million years 
ago until at least 200 million years ago. A 
new specimen of an animal called Biseridens 
qilianicus has recently been unearthed in 


Gansu, China. The specimen is in such good 
shape (pictured, below) that Jun Liu of the 
Chinese Academy of Sciences in Beijing 

and his colleagues were able to confirm 

an earlier hunch that this animal is a very 
early anomodont. In fact, it is the most 

basal anomodont yet found, meaning that 

it is a member of the oldest branch on the 
anomodont family tree. 

This analysis supports the idea that 
anomodonts originated on the old northern 
continent of Laurasia rather than on its 
southern counterpart, Gondwana, as 
previously thought. 


PSYCHOLOGY 


Personality versus mood 


Arch. Gen. Psychiatry 66, 1322-1330 (2009) 
The antidepressant paroxetine doesn't 
just make people happier, it alters their 
personality as well. 

Tony Tang at Northwestern University in 
Evanston, Illinois, and his colleagues studied 
changes in neuroticism and extraversion — 
two personality traits linked to depression 
and the neurotransmitter serotonin — in 
240 patients in a 16-week trial with a one- 
year follow-up. Half of the patients received 
paroxetine, one quarter a placebo and one 
quarter cognitive therapy. 

Placebo patients improved their depression 
scores but reported little change in personality. 
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By contrast, patients on paroxetine reported 

a decrease in neuroticism and an increase 

in extraversion, even after the results were 

normalized for differences in depression 

improvement. Those with the greatest declines 

in neuroticism also showed lower relapse rates. 
Rather than being a mere by-product of 

improved mood, these personality changes 

may help explain why drugs such as paroxetine 

work against depression in the first place. 


EPIDEMIOLOGY 


Malaria's mark 


Science 326, 1546-1549 (2009) 

The deadliest of the four human malaria 
parasites, Plasmodium falciparum, has left its 
imprint on the human genome in the form 
of malaria-protective mutations, including 
those that cause sickle-cell anaemia. 

Now, Lluis Quintana-Murci and Anavaj 
Sakuntabhai at the Pasteur Institute in Paris 
and their colleagues show that — ina similar 
trade-off — pressure from a neglected strain, 
P. vivax, may maintain a common enzyme 
deficiency in southeast Asia that can cause 
jaundice and anaemia. 

The team found that the local gene variant 
associated with the enzyme deficiency was also 
associated with a 30-60% reduction in parasite 
density of P. vivax but not P falciparum. People 
with two copies of the gene had the lowest 
parasite densities. The results suggest that 
P vivax has had a larger effect on the human 
genome than previously thought. 


Correction 

The Research Highlight ‘Rude awakening’ (Nature 
462, 547; 2009) incorrectly described the 

green parts of the image. The figure shows an 
expression pattern of green fluorescent protein 
(GFP) in fruitfly brains, which overlaps with 
expression of dopamine receptors. 


JOURNAL CLUB 


Reuben Shaw 
The Salk Institute for Biological 
Studies, La Jolla, California 


A cancer researcher ponders 
a fundamental connection 
between nutrients and gene 
expression. 


Nutrient availability to single- 
celled organisms varies according 
to their environment, and proteins 
in the cell that sense nutrient 
levels alter gene expression 

to increase uptake and use of 
specific metabolites to fuel cellular 
processes. Conversely, most 


cells in multicellular organisms 
are exposed to constant nutrient 
levels by the bloodstream, and 
so far there are few examples of 
metabolism being directly coupled 
to the control of gene expression. 
A recent paper by Craig 
Thompson and his colleagues at 
the University of Pennsylvania 
in Philadelphia uncovers a direct 
connection between a well-known 
metabolic enzyme — ATP citrate 
lyase (ACL) — and changes in 


gene expression (K. E. Wellen et al. 


Science 324, 1076-1080; 2009). 
Through a chain of reactions, 
ACL influences the functioning 
of the histones, proteins that 


package lengths of DNA — and 
unpackage them for ‘reading’. This 
means that there is a basic — and 
surprising — relationship between 
cell glucose levels and gene 
expression. 

We don't yet know how 
metabolic challenges — for 
example, fasting — in whole 
organisms affect ACL levels or 
activity. But we do know that some 
of the same proteins that increase 
tumour growth also modify ACL by 
attaching phosphorus. 

It is likely that we are just at the 
tip of the iceberg in terms of our 
understanding of the molecular 
basis of how metabolic inputs 
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dictate gene-expression changes 
in mammalian cells. Future studies 
using genetic models of ACL 

loss in distinct mouse tissues, as 
well as chemical inhibitors of the 
enzyme, will help to elucidate 

in which contexts it is critical 

for gene-expression changes in 
the whole organism. Moreover, 
our knowledge of this metabolic 
linchpin may provide a therapeutic 
window for the treatment of 
certain forms of cancer, almost 

all of which undergo metabolic 
adaptation. 


Discuss this paper at http://blogs. 
nature.com/nature/journalclub 


829 


R. SOC. 


nature 


NEWS BRIEFING 


@ POLICY 


UK space agency: The British 
government has announced 
plans for a national space 
agency. The agency will replace 
the British National Space 
Centre, which was set up by 
the government in 1985 to 
coordinate space research. The 
new agency will consolidate 
the efforts of six government 
departments, three science and 
technology funding bodies 

and the Met Office. But it is not 
yet clear whether it will have 
independent control of funds for 
space activities. 


Agriculture reforms: The 
Consultative Group on 
International Agricultural 
Research voted for wide- 
reaching organizational changes 
on 8 December. The public- 
private partnership, which 
oversees 15 research centres and 
supports some 8,000 scientists 
and staff across the world, has 
established thematic science 
programmes and set up a trust 
fund to manage US$500 million 
in annual donations. 


Xenotransplant trials: 
Australia has lifted a 5-year 

ban on clinical trials in which 
animal cells, tissues or organs 
are transplanted into humans. 
The National Health and 
Medical Research Council said 
on 10 December that it was 
satisfied the risks of transmitting 
animal viruses in this way 

were low, and that trials could 
proceed once regulatory and 
monitoring frameworks have 
been established. The United 
States, Russia, European Union 
and China are among many 
places that already allow human 
xenotransplantation trials, the 
council said. 


Countering bioweapons: The 
United States announced on 

9 December that it would try to 
revitalize the Biological Weapons 
Convention (BWC) that bans the 
development, production and 
stockpiling of such weapons. Ellen 
Tauscher, undersecretary of state 
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UNITED NATIONS CLIMATE CHANGE CONFERENCE 2009 


CRUNCH TIME AT COPENHAGEN 


After being derailed by protests from developing countries, the United Nations climate talks in Copenhagen are 
back to something resembling order. As Nature went to press, government ministers led by conference president 
Connie Hedegaard and Yvo de Boer, executive secretary of the United Nations Framework Convention on Climate 
Change (both pictured), were preparing a negotiating text for some 130 heads of state. But the architecture of any 
possible agreement was still unclear, and a deep divide remained between developed and developing nations. 
More than 45,000 delegates, journalists and lobbyists had registered to attend — three times the capacity of 
the Bella Center, where the talks are being held. For daily updates, see nature.com/roadtocopenhagen. 


for arms control, said that the 
country would urge all nations to 
sign the BWC, but rejected efforts 
to develop a system to verify 

that the convention was being 
obeyed. The treaty has foundered 
since the administration of 
former president George W. Bush 
broke with attempts to set up a 
compliance scheme in 2001 (see 
Nature 414, 675; 2001). 


Budget threat: British 
scientists were rattled by the 
UK government's 2010 pre- 
budget report on 9 December, 
which called for a £600-million 
(US$975-million) reduction in 
spending on higher education, 
science and research for the 
period 2011-13. Details of the 
cutbacks were not specified 

— and the present Labour 
government may not win next 
year’s general election — but 
their mention reminded 
researchers that painful spending 
cuts almost certainly lie in wait. 


NUMBER 
CRUNCH 


19% 


The average 
investment return in 
the 2009 financial year 
from more than 500 US 
college and university 
endowments. 


Source: NACUBO-Commonfund 
Study of Endowments 
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Drug-safety rap: Five years after 
the blockbuster painkiller Vioxx 
was withdrawn after being linked 
to heart attack and stroke, the US 
Food and Drug Administration 
(FDA) is still not taking adequate 
steps to ensure the safety of 
marketed drugs, says a report by 
the Government Accountability 
Office (GAO). The report, made 
public on 9 December, checked 
the agency’s progress in enacting 
recommendations from a 2006 
GAO report. It found that the 
FDA had failed to transfer 
sufficient authority for post- 
market drug safety to the Office 
of Surveillance and Epidemiology 
and away from the office that 
approves new drugs. 


@ BUSINESS 


Geothermal project buried: An 
innovative geothermal project 
that would drill deep into the 
Earth to extract energy from hot 
rocks has been shut down. On 


B. STRONG/REUTERS 
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10 December, public authorities 
in Basel, Switzerland, closed 

a private geothermal drilling 
project led by Geopower Basel, 
after a government study found 
that the small earthquakes it 
generated would cause too much 
economic damage. The project 
had been on hold since 2006. See 
page 848 for more. 


Solar-power boost: The World 
Bank announced on 9 December 
that it planned US$5.6 billion 
of financing to speed up the 
deployment of concentrated 
solar power in North Africa 
and the Middle East. Its Clean 
Technology Fund would spend 
$750 million on 11 projects in 
Algeria, Egypt, Jordan, Morocco 
and Tunisia, and $4.85 billion 
would be mobilized from other 
sources, including donors 

and commercial debt. The 
investment will create around a 
gigawatt of solar capacity over 
the next five years, the World 
Bank says — a sizeable addition 
to the 6-7 gigawatts of global 
projects in the pipeline by the 
end of 2008. 


@ RESEARCH 


Harvard stops building: With 
a hobbled endowment, Harvard 
University in Cambridge, 
Massachusetts, announced on 
10 December that it would halt 
construction of the US$1-billion 
science complex on its Allston 
campus early next year once 

the foundations are complete. 
This brings to a standstill the 
university's ambitious expansion 
plans (see Nature 454, 686-689; 
2008). Instead, the university 
said it would shift focus towards 


WISE 
The Wide-Field 
Infrared Survey 
Explorer, a space 
telescope, was 
launched by NASA 
on 14 December. 


improving and leasing vacant 
Allston properties. It is not clear 
when building work might begin 
again; the university is reviewing 
financing options. 


Protein structures removed: 
The University of Alabama 

at Birmingham has asked for 
twelve entries to be removed 
from the Protein Data Bank. This 


follows the recommendation of 
an expert committee that has 
been looking into allegations 

of suspect data. The protein 
structures are cited in hundreds 
of papers. Last week, The Journal 
of Biological Chemistry retracted 
a paper containing one of the 
structures. 


Marine viewing: A sea-floor 
observatory in the northeast 
Pacific Ocean officially went 
live on 8 December, promising 
free access via the Internet to its 
live camera feeds and to data on 
marine life off British Columbia. 
The Can$100-million 
(US$95-million) NEPTUNE 
Canada project (http:// 
neptunecanada.ca) will 

study biological, physical 

and geological processes up 

to 300 kilometres offshore, 
using instruments and sensors 
connected by a cable that runs 
electricity and fibre-optics in an 
800-kilometre undersea loop. 


Malaria optimism: A 
“tremendous increase in 
funding” for malaria has shown 
positive results, the World 
Health Organization (WHO) 
said on 15 December, as it 
published its World Malaria 
Report 2009. US$1.7 billion was 
committed in 2009, compared 
with $730 million in 2006, 
allowing greater distribution of 
insecticide-treated bed nets and 
artemisinin-based therapies. 
From 2000 to 2008, malaria 
cases have been cut by at least 
50% in more than one-third of 
malarious countries. But the 
WHO says that $5 billion is still 
required annually, and resistance 
to artemisinin remains a threat. 


THE WEEK 
AHEAD 


17-18 DECEMBER 

The governing council of CERN 
meets. Its agenda includes 
possible expansion of the 
organization's international 
membership — Israel and Turkey 
are among those who have 
applied to join. 

) go.nature.com/jk7A9T 


18 DECEMBER 

The United Nations Climate 
Change Conference in 
Copenhagen comes to an end. 
b http://en.cop15.dk 


19 DECEMBER 

South Korea's first icebreaker 
research ship, ARAON, is 
scheduled to set sail for the 
Antarctic ona three-month 
expedition. 

> www.kopri.re.kr/index_eng.jsp 


Synchrotron stand-off 
continues: Scientists at the 
Australian Synchrotron in 
Melbourne voted last week to 
again work only between 9 a.m. 
and 5 p.m. in renewed protest 
at the facility's governing board. 
The synchrotron is undergoing 
maintenance over the Christmas 
break, but if the limited- 

hours schedule continues in 
2010, it will cripple research 
projects ina facility usually 
booked around-the-clock. Five 
members of the synchrotron’s 
scientific advisory committee 
also resigned last week (see 
Nature 462, 706-707; 2009). 


Hsinchu, Taiwan. 


BUSINESS WATCH 


Makers of light-emitting diodes (LEDs) are 
preparing to increase capacity in response 
to growing demand. LED suppliers were 
operating at 30% of capacity earlier this 
year, but are now producing at full stretch, 
says Emma Ritch, an analyst at business- 
information firm Cleantech Group in 

San Francisco, California. Among the 
companies that have raised hundreds of 
millions of dollars this year to expand LED 
facilities are Cree, based in Durham, North 
Carolina; Seoul Semiconductor, which 
supplies firms such as Samsung and LG 
with LEDs for televisions; and Epistar in 


Market forecaster Strategies Unlimited 
in Mountain View, California, says that the 
market for high-brightness LEDs will almost 
triple in five years (see graphic). Vrinda 
Bhandarkar, an analyst there, says demand 
for LEDs in mobile phones has peaked, 
but future hot areas include backlights 
for better-contrast, thinner computer 
and television displays. General lighting 
will follow: LEDs are now bright enough 
to illuminate streets, but still costly (see 
Nature 459, 312-314; 2009). Capacity may 
outpace demand by 2011, says Cleantech, 
when prices could crash and the herd of 
more than 500 LED-makers could thin out. 


GLOWING PROSPECTS FOR LEDS 


A growing market is predicted for high-brightness 


2008 
$5.1 billion 


_ Signs 


BB Automotive 
BB Lighting 
!) Other* 
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light-emitting diodes. 


Total market size: 


§ Display backlights 
BB Mobile appliances 


2013 
Total market size: 
$14.9 billion 


q 


Ca 


*Includes traffic signals, indicators, medical devices 
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Budget win for climate probe 


NASA gets cash to replace a failed carbon-emissions observatory, but concerns remain over future funding. 
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The replacement Orbiting Carbon Observatory will monitor Earth's carbon sinks and sources. 


The US Congress is ratcheting up demands for 
NASA to launch Earth-monitoring satellites 
that could help to verify the emissions targets 
currently being debated in Copenhagen. 

Ina US$447-billion spending bill approved on 
13 December (see table), lawmakers told NASA 
to spend $50 million in fiscal year 2010 ona 
replacement for the Orbiting Carbon Observa- 
tory (OCO), which crashed into the ocean near 
Antarctica in February after a rocket failure. 
“Tt looks like there is a future here,” says David 
Crisp, the mission's principal investigator at the 
Jet Propulsion Laboratory in Pasadena, Califor- 
nia. But by adding the OCO to NASAs already 
long list of Earth-science missions — and with 
no promise of future funding — some Earth 
scientists worry that Congress is asking the 
agency to do too much. Berrien Moore, director 
of Climate Central, a think tank in Princeton, 
New Jersey, says that he was both “pleased and 
worried” by the OCO funding because of the 
additional burden on the mission programme. 

By measuring levels of atmospheric carbon 
dioxide, the OCO could provide baseline emis- 
sions data and act as a proof-of-concept that 
carbon sources and sinks can be monitored 
from space. The observatory would measure 
CO, changes to a precision of 1 part per mil- 
lion at a resolution of about 3 square kilometres 
— nearly 30 times that of the Japanese Green- 
house gases Observing Satellite (GOSAT, also 
known as IBUKI), which launched in January. 
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Replacing the OCO will cost about the same as 
the original $280-million mission, says Crisp; 
if funding continues to be granted, the observa- 
tory could be launched as early as 2013. 

That would require a much bigger budget for 
fiscal year 2011, but because NASA is one of sev- 
eral science agencies not included in a targeted 
doubling of basic-science funding (see “Will the 
budget bubble burst?’), it may well face a flat 
budget next year. “Or worse,’ says Moore. 

NASA already has a list of 15 other Earth- 
science missions that were identified in a ‘dec- 
adal survey’ to prioritize missions over the next 
ten years. In the spending bill, Congress said it 
was “concerned” about the limited progress of 
those missions, and gave $15 million to accel- 
erate two that are intended to monitor global 
climate change. It has also instructed NASA to 
look at using commercial providers, following 
the lead of a panel that reviewed the agency’s 
human spaceflight programme and earlier this 
year called for greater reliance on commercial 
rocket companies. 

But Moore doesn't see the Earth-science 
missions being profitable enough for com- 
mercial companies to be interested in running 
them. In lieu of a surprise windfall in Febru- 
ary’s 2011 budget proposals, he says, NASA 
might need to delay the missions further: “We 
may have to rename the decadal programme 
the centennial.” a 
Eric Hand 
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Will the budget 
bubble burst? 


US science agencies saw modest budget 
rises for fiscal year 2010 after Congress 
approved a spending bill on 13 December. 
However, researchers are bracing themselves 
for tighter budgets in 2011 as the government 
clamps down on its deficit spending. 

The National Institutes of Health received 
a 2.3% rise in funds for 2010, although some 
observers say that is because the agency is 
still digesting a $10.4-billion one-off infusion 
from the 2009 economic stimulus package. 

NASA received a 5.3% rise, but funds for its 
$4.5-billion science directorate were shaved 
down by $34 million compared with last 
year. Despite small budgetary windfalls for 
specific climate-monitoring satellites (see 
main story), Earth scientists at NASA are 
worried about the long-term demands on 
their budget. 

Basic-science agencies — the National 
Science Foundation, the National Institute 
of Standards and Technology and the energy 
department's Office of Science — are still 
on course to double their budgets within a 
decade, as dictated by the 2007 America 
COMPETES Act. 

However, as the economy recovers and the 
government shifts its focus from stimulus 
spending to deficit cutting, some agencies 
are planning for flat or falling budgets in 
fiscal year 2011. “It's going to be a tough 
year,” says Patrick Clemins, director of 
the research and development budget 
and policy programme at the American 
Association for the Advancement of Science 


in Washington DC. E.H. 
BUDGET GROWTH FOR 2010 
Agency Budget (per cent 
increase on 2009) 
ational Institutes $31 billion 
of Health (2.3%) 
ASA $18.7 billion 
(5.3%) 
Department of Energy, $4.9 billion 
Office of Science (2.7%) 
ational Science $6.9 billion 
Foundation (6.7%) 
ational Institute of $857 million 
Standards and Technology (4.6%) 


SOURCE: US HOUSE COMMITTEE ON APPROPRIATIONS 


TISSUE PROFILING 

NMR technology may 

help surgeons to make the 
kindest cut. 
go.nature.com/QM4s6p 


Royal Institution faces cash crisis 


The Royal Institution of Great Britain, once 
home to historical figures such as Michael 
Faraday and Lawrence Bragg, has survived 
since 1799 and is the world’s oldest scientific 
research organization. But it now faces a 
financial crisis that could bring its 200-year 
reign to an end. 

The institute offers a rare blend of 
research and outreach, says Richard Catlow, 
who headed its Davy Faraday Research 
Laboratory from 1998 to 2007. In the United 
Kingdom, it is well known for its annual 
Christmas lectures, a series of high-profile 
lectures aimed at the general public that are 
televised nationwide. 

But it depends on fundraising and 
membership for money, and has faced 
financial difficulties in the past. In 2004 
the institution ran up a deficit of £400,000 
(US$650,000), according to a 2005 
financial statement filed with the Charities 
Commission, which regulates charities in 
England and Wales. In 2006, its director, 


Susan Greenfield, a University of Oxford 
neuroscientist known for her high media 
profile, began a £22-million refurbishment 
of its headquarters in central London, 
intended to make it a more attractive venue 
to hire out for conferences and public events. 
To help pay for the work, the research staff 
was cut from 60 to just 15, drawing criticism 
from some scientists (see 
Nature 453, 568-569; 2008). 

The project also ran behind 
schedule and over budget. 
Fundraising was hampered 
by the recession, and the 
institution was forced to dip 
into its endowment and other 
‘restricted funds. By September 2008, it 
had spent £3.2 million designated for other 
activities, including the Christmas lecture 
programme, according to the latest financial 
statement to the Charities Commission, 
dated 29 September 2009. 

Last week, The Guardian newspaper 


place.” 


“Discussions about 
the role of the director 
of the Royal Institution 
are currently taking 


reported that Greenfield was being asked to 
take a pay cut — and reduce the scope of her 
role — to help make up for the shortfall. The 
institution declined to comment to Nature, 
saying only that “discussions about the role 
of the director of the Royal Institution are 
currently taking place between the board of 
trustees and the current director”. 

Chris Rofe, a former 
administrator at London's 
Science Museum and the 
Millennium Dome, was 
brought in this April to 
oversee fundraising and 
financial accountability. The 
Charities Commission audit 
acknowledged that a plan was in place to 
see the organization through to late 2011 
and gradually repay the money spent from 
restricted funds. But it added: “By their very 
nature, there is a significant uncertainty as to 
whether these projections will be achieved.” m 
Geoff Brumfiel 


SNAPSHOT 
China celebrates 
panda genome 


With just 1,600 giant pandas estimated to remain 
in the wild, Chinese scientists have led the task 
of immortalizing the charismatic critter's 2.25 
billion base pairs of DNA, reporting their findings 
online in Nature last week. Although it is unlikely 
to have a significant effect on conservation, the 
work is a proof-of-principle for next-generation 
sequencing technologies, and allows China to 
trumpet work involving a national animal. Indeed, 
one tactic for researchers hoping to win funding 
may be to sequence similarly patriotic symbols. 
“Australia has the most interesting animals in 
the world,” says Jenny Graves, a geneticist at 

the Australian National University in Canberra 
and deputy director of the Australian Research 
Council's Centre for Kangaroo Genomics, who 
analysed sequences from the first marsupial 

(a South American opossum, ironically) and 

the duck-billed platypus. Graves says that such 
efforts are not just gimmicks; the kangaroo 
genomics project has helped researchers to work 
out that the SRY gene determines sex in humans 
and other mammals (J. W. Foster et al. Nature 
359, 531-533; 1992). Other patriotic sequencing 
projects are detailed in the table. 

Brendan Borrell 


Country Organism 
China Giant panda 
Australia Tammar wallaby 


United States (Hawaii) 
France and Italy 
China and United States Rice 


Sweden 


Transgenic papaya 


Wine grape (Pinot Noir strain) 


Norway (European) spruce 


Status 
Draft assembly in 2009 


Whole-genome map in 2008 
Draft assembly in 2008 
Draft assembly in 2007 
Draft assembly in 2002 


Recently announced 
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Satellites beam in biomass estimates 


Additional detail could help bring woodland into a future climate treaty. 


Whatever agreement emerges from the climate 
meeting in Copenhagen, many expect that 
it will include a mechanism allowing rich 
nations to offset their emissions by paying 
poorer countries to protect their forests — and 
the carbon they contain. But just how much 
carbon is at stake? Researchers at the meet- 
ing have given their best answer yet: the first 
satellite-based estimates of the biomass con- 
tained in the world’s tropical forests. 

Current biomass estimates for the tropics are 
based on data gathered by the Food and Agricul- 
ture Organization of the United Nations (FAO), 
and their quality varies greatly from country to 
country. As a result, baseline figures for biomass 
are some of the biggest uncertainties in calculat- 
ing emissions from deforestation and forest deg- 
radation, recently estimated to be around 15% 
of global carbon emissions (G. R. van der Werf 
et al. Nature Geosci. 2, 737-738; 2009). 

The latest assessments, presented at Copen- 
hagen, harness data from multiple satellites as 
well as thousands of ground plots, and should 
help governments and other scientists to esti- 
mate how much carbon is locked within trees, 
vegetation and soils on a given patch of land — 
rather than relying on rough averages that are 
calculated across a forest. 

Sassan Saatchi, an environmental scientist at 
NASA’ Jet Propulsion Laboratory in Pasadena, 
California, worked on one study with research- 
ers at the carbon consulting firm Winrock 
International in Arlington, Virginia. He says 
that their preliminary calculations (see map) 


accord well with previous estimates. South 
America comes in with about 145 gigatonnes 
of carbon in vegetation and soils, about 26% 
higher than what has been reported by the 
Intergovernmental Panel on Climate Change 
(IPCC). The figures for Africa (51 gigatonnes) 
and south Asia (46 gigatonnes) are about the 
same as the IPCC figures. 


A question of scale 

But Saatchi says that the study provides 
additional information about biomass levels 
at regional and national levels. “You cannot 
really nail down this problem unless you have 
the distribution,” he says. “You need to know 
how biomass is distributed and how it’s chang- 
ing over time, almost everywhere, with some 
resolution and accuracy.” 


THE WORLD'S CARBON STORES 


Funded in part by the World Bank, the work 
provides a snapshot at 1-kilometre resolution 
of tropical forests as they were in 2000, when 
most of the satellite data were collected, as well 
as more recent deforestation trends. 

Also at Copenhagen, researchers at the 
Woods Hole Research Center in Massachusetts 
presented another pan-tropical biomass assess- 
ment, which had a resolution of 500 metres. 
Like the Winrock study, it includes spectral 
data from NASA satellites as well as laser 
measurements of forest canopy height from 
an instrument on NASA’ Ice, Cloud, and land 
Elevation Satellite (ICESat) that was designed 
to study polar ice caps. The two teams have yet 
to compare results. 

Richard Houghton, a biomass expert at 
Woods Hole, says that it is good news that mul- 
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SOURCE: WINROCK INTERNATIONAL 


UK research funding proposal is ‘irresponsible’ 


Efforts to judge science on its 


Education Funding Council for 


determine the fate of almost a third 


” 


says, adding that it is “irresponsible 


practical returns often raise 
hackles, and Britain's latest 

plan is no exception. Some of 

the nation's leading universities 
have condemned a scheme that 
would assess the economic and 
social benefits of research to 

help determine who wins a large 
fraction of university funding, and 
more than 12,000 academics have 


signed a petition opposing the plan. 


The public consultation on the 

proposals closed this week. 
Concerns first arose in 

September, when the Higher 
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England (HEFCE) launched 

its proposals for the Research 
Excellence Framework (REF), 

the successor to the Research 
Assessment Exercise (RAE) used 

to divide more than £1.5 billion 
(US$2.4 billion) per year in public 
funds between universities. 
Whereas the RAE did not use impact 
criteria, the REF, due to begin in 2013, 
will require researchers to submit 
case studies detailing examples of 
the societal and economic benefits 
of their research from the past 10-15, 
years. These examples would help to 


of the funding. The overall aim is to 
make explicit how much benefit the 
British taxpayer gets from funding 
research. 

lan Leslie, pro-vice chancellor 
for research at the University 
of Cambridge, considers the 
proposals “neither credible nor 
responsible”. He says that the 
university recognizes that research 
institutions need to communicate 
the impact of the research they 
undertake. But HEFCE’s proposals 
would turn “first-rate universities 
into second-rate companies", he 
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to apportion so much funding on the 
basis of the impact of the research. 
Similar requirements in the 
United States encourage some 
researchers to “oversell” the 
potential impact of their work, 
says William Schowalter, a 
chemical engineer from Princeton 
University, New Jersey, who was 
an international judge in the 
final RAE in 2008. This can skew 
funding towards fields such as 
nanotechnology that promise more 
immediate benefits. 
Peter Knight, deputy rector for 


=n COPENHAGEN LIVE 
—s Nature's reporters blog 
~__ from the United Nations 
“climate conference. 

=! go.nature.com/OEsfAa 


PUNCHSTOCK 


tiple teams are tackling the big-picture ques- 
tion of tropical forest biomass. “We need a 
couple of independent estimates just to see 
how well they match, he says. “Anybody can 
make a map. If they differ, at least it identi- 
fies the areas that need further analysis.” 

The next step, says Alexander Lotsch, a 
geographer at the World Bank in Washing- 
ton DC, is to produce better estimates for 
carbon emissions from deforestation. He 
adds that Saatchi’s research is still a “work 
in progress”. 

Satellites can reliably track deforestation 
and, increasingly, small-scale logging. In 
Copenhagen, Greg Asner of the Carnegie 
Institution of Science in Stanford, Califor- 
nia, and Google.org unveiled an online tool 
that allows tropical countries, beginning in 
South America, to map deforestation using 
an automated system to analyse satellite 
imagery. Asner has also developed a system 
for assessing biomass at finer resolution, 
which will be necessary if forests are going to 
be linked to international carbon markets. 

The new pan-tropical biomass maps from 
Saatchi and Woods Hole won't accomplish 
that goal, but they can provide scientists and 
policy-makers with a better understanding 
of carbon trends. For example, using a simi- 
lar technique to Saatchi, Asner has found 
that deforestation in Brazil is moving into 
higher biomass areas in the interior of the 
forest. That suggests that emissions will 
probably rise over time on a per-hectare 
basis, offsetting some of the reductions in 
deforestation that Brazil aims to achieve in 
the coming decade (see Nature doi:10.1038/ 
news.2009.752; 2009). | 
Jeff Tollefson 
See also www.nature.com/roadtocopenhagen 


research at Imperial College London, says 
that he wants to delay the REF by a year to 
incorporate the findings of planned pilot 
trials. He recommends that just 15-20% of 
the audit be devoted to impact assessment, 
whereas the University and College Union, 
a trade union for academics, would like the 
impact component to be removed. 

David Price, vice-provost for research at 
University College London, says that any 
assessment of impact should include benefit 
to the academic community, and not just the 
economy and society as currently proposed, 
to ensure that fields such as mathematics 
and social sciences are not disadvantaged. 

HEFCE will publish a summary of the 
responses to the consultation, and its plans 
for the REF, in spring 2010. | 
Natasha Gilbert 


Hope for Japan's key projects 


When Japan's government changed hands in 
September for the first time in five decades, 
many Japanese people hoped that the 

newly powerful Democratic Party of Japan 
would revitalize their country. But the new 
government has since sent scientists on an 
emotional rollercoaster. In recent weeks, 
two cabinet-level bodies, both chaired by 
Prime Minister Yukio Hatoyama, have 
recommended drastically different financial 
futures for major scientific projects. 

One set of proposals, from the 
Government Revitalization Unit (GRU) 
that was set up in September to trim 
bureaucratic fat, recommends deep cuts 
for many key projects. These include a 
proposed next-generation supercomputer, 
the SPring-8 synchrotron in Harima, and 
Earth-science research. Scientists protested 
against those cuts (see Nature 462, 557; 
2009). But last week came news of a separate 
recommendation from the Council for 
Science and Technology Policy (CSTP), 
Japan’s highest science-policy-making body, 
proposing continued support for those 
projects and many others. 

For instance, the SPring-8 synchrotron 
and the Global Center of Excellence 
programmes — meant to strengthen 
doctoral research programmes — had each 
been headed for cuts of one-third or more, 
but the CSTP says they should be “prioritized 
and given the necessary resources”. The next- 
generation supercomputer, which could have 
faced outright termination, should also be 
supported, it says. 

In Japan, where government decisions are 
usually made in bureaucratic back rooms 
and handed out as a harmonious consensus, 
the apparent contradiction is baffling 
researchers. “The decision-making process 
is unclear,’ says Tadashi Watanabe, project 
leader for the supercomputer. “It is very 
unsettling.” 

Final budget decisions will be made 
later this month, but the prime minister 
has called the CSTP proposals “valuable 
opinions’, and said that he would “work 
to ensure they were reflected in the final 
budget”. Many think Hatoyama could 
be leaning towards accepting the CSTP’s 
recommendations, perhaps because 
of the outcry over the proposed cuts. 

“It’s too early to tell, but you can safely say 
that the top leadership did recognize the 
problem,” says Atsushi Sunami, a science- 
policy expert at the National Graduate 
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Scientists are hoping that Japan's prime minister 
Yukio Hatoyama will avert proposed funding cuts. 


Institute for Policy Studies in Tokyo. 

The GRU was earlier criticized by 
scientists for recommending cuts without 
obtaining sufficient external input; projects 
were usually explained to the decision- 
making committee in a one-hour session 
by a bureaucrat. According to the Japanese 
media last week, the GRU plans to reopen 
debate on the supercomputer project in 
a public forum that will involve many 
scientists. 

For researchers, the near demise of 
beloved projects has been a wake-up call to 
the need to justify them to the public as well 
as to bureaucrats and external evaluators. 
Last week, leaders of the supercomputer 
project posted on their website a list of 
frequently asked questions on the project’s 
significance, including: “What is great 
about the supercomputer?” 

Watanabe says the team doesn't yet 
have a long-term strategy for engaging the 
public, but he wants to emphasize that the 
supercomputer would have a major role 
in Japan’s main research fields, including 
nanoscience, life science and environmental 
science. “We have to get that point across,” 
he says. 

Sunami agrees that scientists in Japan can’t 
take public support for granted. “Even if the 
budget for the supercomputer and SPring-8 
are saved at a smaller scale,’ he says, “they 
have to engage with the public more.” a 
David Cyranoski 
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Modellers claim wars are predictable 


Insurgent attacks follow a universal pattern of timing and casualties. 


Seemingly random attacks and a shadowy, 
mysterious enemy are the hallmarks of insur- 
gent wars, such as those being fought in Afghan- 
istan and Iraq. Many social scientists, as well as 
the military, hold that, like conventional civil 
wars, these conflicts can’t be understood with- 
out considering local factors such as geography 
and politics. But a mathematical model pub- 
lished today in Nature (see page 911) suggests 
that insurgencies have a common underlying 
pattern that may allow the timing of attacks and 
the number of casualties to be predicted. 

“We found that the way in which humans do 
insurgent wars — that is, the number of casual- 
ties and the timing of events — is universal,” 
says team leader Neil Johnson, a physicist at the 
University of Miami in Florida. “This changes 
the way we think insurgency works.” 

Johnson and his colleagues argue that the 
pattern arises because insurgent wars lack a 
coherent command network and operate more 
as a “soup of groups’, in which cells form and 
disband when they sense danger, then reform in 
different sizes and composition. The timing of 
attacks, the authors say, is driven by competition 
between insurgent groups for media attention. 

Johnson, who has presented preliminary 
versions of the work to the US military, says 
that the findings allow a glimpse into the heart 
of insurgency behaviour. “We can get a sense 
of what is going on and what might happen 
if we intervened in certain ways,’ he says. He 
is now working to predict how the insurgency 
in Afghanistan might respond to the influx 
of foreign troops recently announced by US 
President Barack Obama. 


Power law 
The researchers collected data on the timing 
of attacks and number of casualties from more 
than 54,000 events across nine insurgent wars, 
including those fought in Iraq between 2003 
and 2008 and in Sierra Leone between 1994 
and 2003. By plotting the distribution of the 
frequency and size of events, the team found 
that insurgent wars follow an approximate 
power law, in which the frequency of attacks 
decreases with increasing attack size to the 
power of 2.5. That means that for any insur- 
gent war, an attack with 10 casualties is 316 
times more likely to occur than one with 100 
casualties (316 is 10 to the power of 2.5). 
“This is surprising because these wars are all 
fought in different terrains and under different 
circumstances,’ says Johnson. “It shows that 
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Could a model help to predict the number of casualties in conflicts such as that in Afghanistan? 


there is something going on in the way these 
wars are fought that is common to all” 

To explain what was driving this common 
pattern, the researchers created a mathemati- 
cal model that assumes that insurgent groups 
form and fragment when they sense danger, 
and strike in well-timed bursts to maximize 
their media exposure. The model gave results 
that resembled the power-law distribution of 
actual attacks. 

“They show a nice agreement between the 
data and their model, which is an important 
first step,’ says Aaron Clauset, who researches 
the mathematics of conflict at the Santa Fe 
Institute in New Mexico. But he and others 
question the model’s assumptions, such as the 
number of insurgents in the conflict remaining 
roughly fixed over time. Clauset says that this 
idea does not match with other findings. 

The model also assumes that insurgent 
groups can freely break up then re-form. But 
Roy Lindelauf, who models terrorist networks 
at the Netherlands Defence Academy in Breda, 
notes that some insurgents in Iraq are battling 
each other as well as the coalition forces, and 
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would therefore not merge into a single group. 

Lars-Erik Cederman, a researcher in inter- 
national conflict at the Swiss Federal Institute 
of Technology in Zurich, adds that the model 
“has the potential to improve knowledge about 
warfare”. But, he says, the authors “go too far in 
claiming they have found a universal under- 
lying pattern” because their work includes 
only nine wars. Cederman, part of a group that 
regards insurgency as similar to general war- 
fare, also says that although terrorist attacks can 
be driven by competition for media attention, 
it remains far from clear whether insurgencies 
have the same motive. 

“In human social systems, it is usually dif- 
ficult to nail down what mechanism is behind 
an observed behavioural pattern,” Clauset says. 
“There are almost always several equally plausi- 
ble explanations that need to be considered” 

Johnson agrees that there could be other 
explanations for the pattern his group has 
found. But he says, “We have looked for many 
years for a model, and this is the only one we 
have found that explains the data.” 

Natasha Gilbert 
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STEM-CELL INDUCTION 
MADE SIMPLER 
Inserting genes at just 
one location induces 
pluripotency. 
go.nature.com/nnitjD 
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Consent issue dogs stem-cell approval 


The US expansion of federal funding for 
human embryonic stem-cell research is 


being hampered by details in consent forms. 


Earlier this month, researchers 
celebrated the government’s approval of 
funding for a broad variety of work on 
13 stem-cell lines — the first approved 
under the policy announced by US 
President Barack Obama in March. 

But on 14 December, Francis Collins, 
director of the National Institutes of Health 
(NIH) in Bethesda, Maryland, decided 
to respect the unanimous opinion of his 
standing advisory committee and restrict 
NIH funding for an additional 27 lines to 
purposes outlined in the associated consent 
form. He has as yet declined to discuss his 
reasoning. 

The forms signed by the donating 
couples stipulated that cells derived from 
the embryos “will be used to study the 
embryonic development of endoderm with a 


focus on pancreatic formation” with the long- 
term goal of diabetes treatment. Collins's 
verdict could set the tone for decisions 
involving some of the 80 other lines awaiting 
approval at the NIH — and another 242 lines 
that scientists are preparing to submit. 

“This is going to be a recurring issue for 
the NIH: it is going to look 
at consent forms that either 
include restrictive language, 
or failed to include enough 
information about the broad 
array of research possibilities,’ 
says Robert Streiffer, a bioethicist at the 
University of Wisconsin-Madison. “Either 
of those situations is a problem from the 
perspective of informed consent.’ 

The cell lines at issue were derived 
starting almost a decade ago by Doug 
Melton at Harvard University in Cambridge, 
Massachusetts. In 2005, responding toa 
request from Melton, Harvard’s institutional 


“We have to take 
very literally what 
is written in that 
informed consent.” 


review board freed up the lines to be used for 
“any legitimate scientific purpose”. The cell 
lines had been anonymized, and the donors 
couldn't be traced for new consent. 

But Mary Beckerle, a cancer biologist 
at the University of Utah in Salt Lake City, 
told Collins at a 4 December meeting of his 
advisory committee that “we 
have to take very literally what 
is written in that informed 
consent”. 

George Daley, a stem-cell 
researcher at Children’s 
Hospital Boston in Massachusetts, says 
that despite the restrictions imposed on the 
Melton lines, federally funded researchers 
will get the cells they need. “Part of the 
advantage of having a policy that will have 
hundreds of new lines is that everybody 
should be able to find lines that are suitable 
for their own work,” he says. | 
Meredith Wadman 
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French research wins huge cash boost 


President Sarkozy uses ‘big loan’ to push his reform agenda. 


Universities in France are set to receive an 
€11-billion (US$16-billion) windfall from a 
government initiative intended to create an 
‘Ivy League’ of research centres. The cash 
could help to reverse the decades-long neglect 
of the country’s university system, although 
the bulk of the funding is likely to be chan- 
nelled to just a few institutions. This would 
break a long-standing taboo in France, where 
the 83 previously centralized state universities 
have long been considered equal, at least in 
terms of government funding and research- 
ers’ pay scales. 

The funding is part of a €35-billion pack- 
age — the grand emprunt, or ‘big loan’ — 
announced by President Nicolas Sarkozy on 
14 December, intended to boost the coun- 
try’s long-term competitiveness. Borrowed 
mainly from international financial mar- 
kets, the investment represents a hefty 1.8% 
of France’s annual gross domestic product, 
roughly equivalent in scale to the $53-billion 
science stimulus package announced earlier 
this year by the United States (see Nature 461, 
856-857; 2009). 

Sarkozy's spending priorities (see graphic) 
broadly follow recommendations made 
on 17 November by a blue-ribbon panel of 
researchers, industrialists and economists, 
and chaired by two former prime minis- 
ters, conservative Alain Juppé and socialist 
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Michel Rocard. Panelist Edouard Bard, an 
oceanographer at the Paul-Cézanne Uni- 
versity in Aix-Marseille, says he is pleased 
that Sarkozy heard the panel’s argument that 
increasing the international competiveness of 
France’s universities was the grand emprunt’s 
top priority. The funding should give univer- 
sities the freedom and funding to attract tal- 
ent from around the world by offering more 
competitive salaries and generous lab fund- 
ing, he says. 


Pillars of excellence 
The stimulus funds will be 
ate five to ten campuses of 


used to cre- 


446, 847-850; 2007). Other universities will, 
however, be eligible for other funding, includ- 
ing €1 billion to hire leading researchers or buy 
equipment to create ‘labs of excellence. 

The windfall continues the reforms of a 2007 
law that aimed to make universities more com- 
petitive by freeing them from central govern- 
ment control and allowing them to manage 
their own budgets, staff, salaries and buildings. 
Italso reinforces Sarkozy’s plan to shift research 
power away from the national research agencies 
and towards the universities, and follows the 
creation in 2007 ofa national research council, 
which awards competitive grants on the basis of 

peer-reviewed proposals. Most 


research excellence, each of 
which will receive an endow- 
ment of up to €1 billion. The 
scheme is partly modelled on 
Germany’s multibillion-euro 
Excellence Initiative, says 


“Big is not always 
beautiful. It will totally 
ignore the pockets of 
excellence in smaller 
universities.” 


labs had previously operated on 
a recurrent stream of govern- 
ment funding. 

Yet French universities often 
lack the agencies’ expertise 
in managing large research 
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Bard (see Nature doi:10.1038/ 
nature08269; 2009). 

A further €1 billion was awarded, on top of 
€850 million allocated last year, to a project to 
transform the Saclay plateau just outside Paris 
— one of the highest concentrations of research 
infrastructure in Europe — into a supercampus 
by melding many institutions into a single body, 
fulfilling one of Sarkozy’s campaign pledges 
from the 2007 presidential election (see Nature 
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projects and funds, says 
Philippe Froguel, a French geneticist working 
at Imperial College London. Most administra- 
tors of French universities, he says, are not in 
the same league as the heads of major US and 
UK universities. They will need to develop such 
expertise quickly if the universities are to spend 
the new funds well. Arnold Migus, director- 
general of France’s CNRS, the largest national 
funding agency for basic research in Europe, 
notes that most university labs are joint CNRS 
labs, and that the agency is helping universities 
to get up to speed. 

Some French researchers say that the new 
funds mask cuts in staff and research else- 
where. The “so-called massive investment” is 
a “mirage”, says Bertrand Monthubert, spokes- 
man for higher education and research of the 
opposition Socialist party, and a mathemati- 
cian at Paul Sabatier University in Toulouse. 
He argues that the endowments’ annual yields, 
which are vulnerable to the economic climate, 
would be similar to the annual budget rises that 
the universities normally receive. 

Froguel applauds concentrating the funding 
on fewer players, but fears the money will flow 
only to the largest institutions. “Big is not always 
beautiful, he says. “It will totally ignore the 
pockets of excellence in smaller universities.” 
He favours rewarding the best departments or 
research networks wherever they are, much as 
Britain’s Research Assessment Exercise does 
(see page 834). | 
Declan Butler 
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Watching science at work 


A network of social scientists in the United Kingdom is seeking better ways to study the work of 


biologists. But, asks Colin Macilwain, can it earn its subjects’ trust? 


here was something ofa chill in the air 
at Cardiff’s City Hall in October, and 
not just because autumn was arriv- 
ing. Social scientists, and some life 
scientists, were gathering there for the 
annual meeting of the Genomics Network, 
a programme run by Britain’s Economic and 
Social Research Council (ESRC) to stimulate 
dialogue between the disciplines. The plenary 
talks got under way, led by 
Christine Hauskeller, a phi- 
losopher with the network 
from the University of Exe- 
ter, and Martin Evans, win- 
ner of the 2007 Nobel Prize 
in Physiology or Medicine 
for his development of gene 
knockout technology. But 
when the initial call for ques- 
tions sparked little real discussion, it was clear 
that dialogue was going to take some stimulat- 
ing. Then Evans was asked what he thought of 
his hosts. “They like to say, why are we doing 
things?” he growled. “We should be asking, 
why are they doing things?” 
The Genomics Network started its work 
back in 2002. The previous year, when the 
British government had said it would earmark 
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“Social scientists have 
become welcome, 
and indeed essential, 


partners with the 
natural sciences.” 
— Brian Wynne 


an additional £200 million (US$290 million in 
2001) to genomics research, the ESRC success- 
fully argued that about £9 million of it should 
go to the investigation, by social scientists, of 
genomics issues and the scientists who study 
them. Seven years after its creation, the net- 
work, which supports about 100 researchers 
at five universities, is one of the largest projects 
of its kind in the world, and has broadened its 
interests beyond genomics 
to embrace synthetic biology 
and other areas. Two years 
ago, the ESRC announced a 
further £18 million in fund- 
ing for three of its centres after 
peer review. The money was 
for a second, and final, five- 
year term: the ESRC doesn't 
support permanent centres of 
excellence, on the grounds that societal chal- 
lenges are always changing. 

The centres have a very broad scope. 
Cesagen, the Centre for Economic and Social 
Aspects of Genomics, is the largest of the 
groups, and is co-hosted by Cardiff Univer- 
sity and Lancaster University. The University 
of Exeter is home to Egenis, which uses phil- 
osophy-based approaches to study genomics 
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questions. Innogen, which studies innovation 
in genomics and the life sciences, is co-located 
at the University of Edinburgh and the Open 
University in Milton Keynes. And the Genom- 
ics Forum, also at Edinburgh, was set up in 
2004 to help coordinate the Genomics Net- 
work and push its findings into wider political 
and public arenas. 


Embedded in the community 
Unlike some previous attempts by sociologists 
to ‘study’ scientists at work, in these centres 
the social scientists are organized into multi- 
disciplinary teams — often including lapsed 
natural scientists, as well as sociologists and 
philosophers — with funding to do empirical 
research. And the researchers embed them- 
selves deeply in the community of natural 
scientists that they are seeking to study. 
Three-quarters of the way though the centres’ 
ten-year lives, their track record is mixed. They 
have provided a stable and conducive environ- 
ment for social-sciences research, much of their 
work is undoubtedly original and some of it has 
made its impact felt in policy circles, influenc- 
ing debates on the legislation of animal-human 
hybrid embryos, for example, and on innovation 
at the Organisation for Economic Co-operation 


OTTO 


EGENIS 


and Development (OECD) in Paris. “I think 
that we've really added value by having these 
centres,” says Ruth Chadwick, the director of 
Cesagen and chair of the ethics committee at 
the international Human Genome Organisa- 
tion. “The ESRC made it clear from the outset 
that it wanted to see interactions with hard 
scientists. And we've seen increasing open- 
ness to such collaboration: in part because the 
funding agencies demand it” 

Well-trained social scientists can play a much 
bigger part in helping scientists to build bridges 
with the outside world, says Grahame Bulfield, 
former head of science and engineering at the 
University of Edinburgh and an early champion 
of the two centres there. “We need organizations 
that will study the interactions between science 
and society in a scholarly way, and interpret 
their findings for the public,” he says. 

But some critics fault the ESRC for failing 
to provide sufficient direction for the network 
since its foundation. Its staff case officer, Liz 
Grassby, is the fourth or fifth official to hold 
that position since the network was conceived. 
“The quality of the research output has not 
been as good as you might have expected,” 
said one senior social scientist from outside 
the network. “Much of the problem can be 
laid at the door of the ESRC; for whatever rea- 
son, it has high staff turnover and no collective 
memory. The initiative could have been better 
managed.” 


Chinks in the wall 

The relationship between natural and social 
scientists has, historically, been more fraught 
than fruitful. Scientists are often prickly about 
being studied by outsiders such as sociologists 
or historians of science. 

Lately, however, some chinks have appeared 
in the wall that separates the two realms. 
Brian Wynne, a sociologist, former physicist 
and associate director of Cesagen, based at 


Lancaster, says that he has noticed profound 
changes over the past decade in the natural sci- 
ences’ receptiveness to social science. “Social 
scientists have become welcome, and indeed 
essential, partners with the natural sciences,” 
he says. “We've become embedded — like the 
media folks in the Iraq war,’ he says. 

Wynne, for example, has been involved in 
planning a citizens’ science programme at the 
Natural History Museum in London, which 
seeks, among other things, to integrate huge 
banks of data collected by amateur naturalist 
groups into the museums study of biodiversity. 
Scientists such as Johannes Vogel, keeper of 
botany at the museum, have 
built on Cesagen’s work on pub- 
lic engagement to help draw 
on this amateur expertise. For 
example, the detailed observa- 
tion of river conditions by fly 
fishermen led UK regulators to 
revise their criteria for measur- 
ing water quality. 

In a separate but related 
project, Vogel, Wynne and two Cesagen 
researchers, sociologist Claire Waterton and 
anthropologist Rebecca Ellis, have been con- 
tributing to the Barcode of Life initiative, which 
seeks to build tools that will enable biologists to 
identify species from short stretches of DNA. 
Here, the social scientists have been mediat- 
ing between people who have their own genetic 
methods for species identification (such as 
public-health officials checking different 
strains of mosquito) and the larger interna- 
tional project, which needs global standards 
for genetic bar-coding. Asa result, Wynne says, 
the whole project is embracing standards, such 
as on what gene segments to use, that better 
incorporate existing approaches. 

It was James Watson who pioneered the large- 
scale, systemic involvement of non-scientists in 
the life sciences when, as associate director for 


John Dupré (front row, centre) and his team at Egenis study the social and philosophical issues of genomics. 
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"Scientists don't 
especially want to 
know whether their 


work produces an 
ethical dilemma." 
— Tony Woods 
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human genome research at the US National 
Institutes of Health in 1988, he casually sug- 
gested that about 3% (later 5%) of all Human 
Genome Project funds — about $10 million per 
year since then — should go to the investigation 
of the Ethical, Legal, and Social Issues (ELSI) 
involved in the Human Genome Project. 

The ELSI programme now serves as the 
dominant global model for this sort of social- 
science effort. But it also became synonymous 
with poor relations between the observers and 
the observed. From the scientists’ point of view 
“there were two main frustrations with ELSI’, 
says Robert Cook-Deegan, director of the 
Center for Genome Ethics, Law 
and Policy at Duke University 
in Durham, North Carolina. 
“One was its association with 
what I call ‘finger-wagging eth- 
ics, — telling researchers how 
they ought to conduct their 
business. “The other was the 
way that it created a constitu- 
ency that wanted grant money 
more than it wanted to go out and help solve 
real problems.” 

Ros Rouse, the ESRC programme officer who 
built the Genomics Network and is now head of 
policy at Research Councils UK, the umbrella 
group for the seven UK research councils, 
says the ESRC wanted the Genomics Network 
centres to address “a totally different terrain’, 
from the ELSI programme, including original 
research into the sociology of biology and med- 
icine, and the nurturing of better links between 
science, the public and policy-makers. 

“This is more of a serious attempt to engage 
with the life sciences than the original ELSI, 
which was seen as an ‘add-on’ to the genome 
project,’ says John Dupré, a philosopher of 
biology who runs Egenis. Dupré is particularly 
interested in the way that scientists continue to 
work with a ‘tree of life} the representation of 
species relationships to each other in a branch- 
ing tree, even though genomic data challenge 
it. Genome sequencing has shown that bacteria 
and other prokaryotes have swapped genes so 
extensively that their evolutionary histories 
cannot be represented on a conventional phy- 
logenetic tree’. 

Dupré and his colleagues are now working 
with philosophers, evolutionary biologists and 
others to develop other means, such as webs or 
grids, to represent organisms’ genetic relation- 
ships. Ford Doolittle of Dalhousie University 
in Halifax, Canada, is one of the biologists 
most closely involved. “I think that it’s been 
very useful because biologists don’t think very 
much about philosophy,’ he says. “We impose 
patterns on nature for philosophical reasons, 
and then deny that philosophy is important. 
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Philosophers care about the structure of 
arguments and it is a very good exercise for 
scientists to start looking at this too” 

Researchers at Innogen have been examining 
what Joyce Tait, a former chemist and the 
first director of Innogen, calls the “innova- 
tion triangle” of the pharmaceutical industry, 
the regulators and the consumer. Tait and her 
colleagues say that the speed and nature of 
innovation in some sectors, such as pharma- 
ceuticals and agricultural biotechnology, are 
determined largely by the regulatory appara- 
tus. They argue, in particular, that it is not the 
severity of a regulatory system, but rather its 
ability to discriminate in how it treats large and 
small companies, that has the greatest bearing 
on rates of innovation. 

These findings are being absorbed in high 
places. The OECD, for example, used an 
Innogen study” about the future of the phar- 
maceutical industry as the basis for the health 
component of a report’, The Bioeconomy 
to 2030, that it published in June. Given the 
OECD’s considerable prestige, the report is 
likely to exert a strong influence on govern- 
ment approaches to drug regulation around 
the world. 


Broad reach 
The ESRC always wanted the Genomics 
Network to reach out beyond academia to 
public and policy circles, and the Genomics 
Forum was added with this in mind. “If you 
have a social-science centre, its prime empha- 
sis is going to be on getting its best work pub- 
lished,’ says Steven Yearley, a sociologist and 
director of the forum. “What the forum does is 
to accept this and make sure that we take that 
work to a broader audience.” 

Research teamsatthe network centres, including 
Jane Calvert at Innogen and Emma Frowat the 
Genomics Forum, are involved in helping to 


shape UK participation in the field of synthetic 
biology — the search for approaches to build 
novel biological systems and even entire organ- 
isms. The concept of artificial life is expected 
to stir strong public passions, and those study- 
ing it have turned to social scientists from the 
outset. Some argue that if synthetic biology is 
to get off the ground, the 
working dynamic between 
biologists and engineers 
needs to be examined and 
improved. Calvert and Frow 
have been studying some of 
these questions, and helping 
scientists and organizations 
such as the Royal Academy 
of Engineering in early pub- 
lic consultations on synthetic biology. 

Alistair Elfick, a medical engineer at the 
University of Edinburgh, is joint leader of a 
UK-wide synthetic-biology network that is 
looking at the technical standards that may 
be needed if approaches to synthetic biology 
— such as the design of DNA ‘biobricks’ that 
can be pieced together like Lego — are to be 
successfully pursued. He says he appreciates 
the social scientists’ perspective. “Having their 
insight will be hugely valuable to us,” he says, 
adding that the nascent discipline is serious 
about working with social scientists on public 
engagement. “It’s a matter of entering into a 
dialogue with the public about what it wants us 
to do. We need to have the authority of society, 
in order to proceed.” 

But does the social study of science need to 
have this kind of practical utility? Some social 
scientists caution that producing work that is 
useful to policy-makers, scientists or the public 
— as the Genomics Network has set out to do — 
is not always consistent with their core scholarly 
activity, of seeking to better understand how sci- 
ence works. Paul Martin, a medical sociologist 


—— 


Social scientists are tackling concerns raised by research to build systems from ‘biobricks’. 
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“Philosophers care about 
the structure of arguments 
and itis a very good 


exercise for scientists to 
start looking at this too.” 
— Ford Doolittle 


at the University of Nottingham who is not part 
of the network, warns that the desire to make 
work fit the needs of policy-makers can create 
conflicts of interest, in which those who are try- 
ing to objectively study science and innovation 
end up being part of the scientific and innova- 
tive process. “We want to be engaged, but not 
reduced to a handmaiden’s 
role for new technologies,” 
he says. 

Nik Brown, a sociologist 
at the University of York 
who is also outside the net- 
work, worries that policy- 
makers and scientists can 
expect the wrong things 
from social scientists. 
“There's a misperception that our main role is to 
ease the interaction between scientists and the 
public, he says. “What we want to do is under- 
stand the science, and how it is constructed — 
which is not a public-understanding-of-science 
question” 

The biggest practical challenge for the 
network, though, is the interactions between 
the scientists and social scientists themselves. 
One senior researcher, who knows the Genom- 
ics Network well, said privately that social 
scientists still have difficulty speaking a lang- 
uage that scientists can relate to. “There’s still a 
huge credibility gap,’ according to this observer. 
“Their methodologies just don't align well.” 

The issues that interest social scientists 
most are not always the top concern of many 
research scientists, who are busy with funding, 
publications and the science itself. “Scientists 
don’t especially want to know whether their 
work produces an ethical dilemma,’ says Tony 
Woods, head of medicine, society and history 
grants at the London-based Wellcome Trust, 
the medical-research charity. “Questions about 
societies’ concerns don't always weigh heavily 
on their minds” 

Asked whether the network has succeeded 
as whole, Jim Stevenson of the University of 
Southampton, a psychologist and member of 
the ESRC’s strategic research board, equivo- 
cates somewhat: “I think now it is working well, 
but it has taken a while” He says that the centres 
need to persevere. “We've got to get to a situa- 
tion where science, and industry, takes this work 
seriously,’ he says. “Butit’s a slow process” ™ 
Colin Macilwain is a writer based in 
Edinburgh, UK. 
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The genome 


finishers 


Dedicated scientists are working hard to close the gaps, fix the errors and finally complete the 


human genome sequence. Elie Dolgin looks at how close they are. 


rom her windowless fifth-floor office at 
the US National Institutes of Health in 
Bethesda, Maryland, Deanna Church 
has few distractions from the job that 
lies before her. On her computer sit 888 open 
‘tickets, or outstanding problems with the 
human genome sequence. Although that 
number fluctuates, it’s a not-so-subtle reminder 
that she and her team at the National Center for 
Biotechnology Information (NCBI) havea long 
way to go to finish the job started nearly two 
decades ago by the Human Genome Project. 
This is the same project that an international 
team of scientists spent close to US$3 billion on 
to complete. In 2000, the scientists announced, 
to much fanfare at a White House ceremony, 
that they had finished the draft sequence of 
the human genome. They waxed poetic about 
opening ‘evolutions lab notebook’ when 
they published the draft the next year’. 
And they uncorked champagne 
bottles again in 2003 when the 
sequence was officially 
deemed finished’. By 
then, media outlets were 
reporting the develop- 
ments with a twinge of 
fatigue. “This time it is the real 
thing, scientists promise,’ New Scientist 
reported. Another year passed before the final 
analyses were published’ , and two more went 
by before the paper detailing the last, fully pol- 
ished chromosome came out in 2006 (ref. 4). 
Still, three years later, Church is hunched 
over her computer, clicking away at her mouse, 
quietly clearing up the lingering troubles with 
the iconic sequence. Some of her tickets, sub- 
mitted by her collaborators and users from 


“The work's not sexy. 


around the world, are reports of missing bits. 
Others describe stretches in which someone 
thinks the sequence is mistaken. Still others 
are unique and unexpected challenges, such as 
complex DNA rearrangements, that could take 
years to sort out. 

“It’s a frustration,” says Richard Gibbs, direc- 
tor of the Human Genome Sequencing Center 
at Baylor College of Medicine in Houston, Texas. 
“It’s an extremely high-quality genome. It's the 
best there is, period. The problem is that a very 
small percentage of uncertainties still translates 
into a significant number of problems.” 

Church and her colleagues are working 
to build a solid, accurate reference, but their 
efforts have revealed how slippery that concept 
can be. The sequence, for instance, does not 

represent any one person's genome. It is an 
amalgam of DNA from different 

~s. people, both male and female. It 
was put together this 
way to maintain ano- 


Rapes it nymity for those who 
Butit's important. contributed the DNA 
— Deanna Church and to ensure that the 


sequence represented 
all humanity — “our 
shared inheritance’, as then-head of the 
project, Francis Collins, said. 

But that shared inheritance is hard to capture. 
The genomes of two individuals look less alike 
than many had originally assumed. Rather than 
following a linear path of 3 billion base pairs 
with a letter changed now and then along the 
way, human genomes detour into hundreds of 
vastly different stretches in which, depending 
on the individual, millions of base pairs can be 
deleted, inserted, repeated or inverted. 
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A finished reference genome — if attainable 
— will therefore look very different from the 
project’s first renditions. That’s where Church 
and her team of finishers come in. They are 
striving to smooth out the differences and to 
develop a more dynamic platform that can 
capture much of humanity’s commonalities 
and uniqueness. Some say it’s a wasted effort 
now that individual human genomes can be 
sequenced at a fraction of what it cost ten years 
ago, but most say the reference is invaluable as 
a bedrock to support the sequencing of future 
human genomes. 

Resolving the problems in the sequence 
will not win Church many accolades. She 
won't meet the president or land any papers 
in high-impact journals as those who “fin- 
ished” the genome before her did. And once 
she puts a ticket to rest, there’s always another 
one waiting. “It’s not sexy,” she says. “But it’s 
important.” 


A coalition of the responsible 

By April 2003, the sequencing had surpassed 
the international project's technical definition 
of completion — the sequence contained fewer 
than 1 error per 10,000 nucleotides and cov- 
ered 95% of the gene-containing parts of the 
genome. But there were still errors — around 
350 gaps in the sequence — and much of the 
structural variation was not included. 

In 2004, Church and a few dozen researchers 
met to discuss genomics and structural vari- 
ation at the Wellcome Trust Sanger Insti- 
tute in Hinxton, UK. One complaint was 
echoed repeatedly: there was no easy way 
to fix or update the genome with new data. 
In the 1990s, when sequencing was in full 
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THE REMAINING GAPS 


In March 2009, the Genome Reference Consortium released its first assembly of the human genome that had originally been completed by the Human Genome Project. 
The sequence closed 25 gaps and added alternative versions at three complex regions (red arrows). The sequence still contains nearly 300 gaps (general locations indicated 
by black arrows) that range from 700 to 30 million base pairs long, not including parts within the telomeres and centromeres (green) that are intractable to sequencing. 


swing, researchers could contact the specific 
chromosome curators at each of the major 
sequencing centres involved with the project 
to report any sequencing slip-ups. But by 2004, 
few of the centres involved were actively moni- 
toring their slices of the genome and there was 
little scientific impetus to revisit old work. This 
posed a problem. “Someone needs to have 
responsibility for the genome so that if errors 
are found, improvements can be made,’ says 
Adam Felsenfeld, a programme director at the 
National Human Genome Research Institute 
(NHGRI) in Bethesda, Maryland. 

Together with Ewan Birney of the European 
Bioinformatics Institute (EBI) in Hinxton, 
Church appealed to the NHGRI and the Well- 
come Trust for funding. It took more than 
two years of meetings and deliberations, but 
eventually the NHGRI agreed to set aside up 
to US$1 million in operating 
funds from an annual large- 
scale sequencing award 
(more than $30 million per 
year) to Washington Uni- ~~ 
versity in St Louis, Missouri. 
Sanger and the Wellcome Trust 
agreed to a similar amount, and 
the EBI and NCBI handle the 
informatics as part of their 
normal operations. The col- 
laboration, known as the Genome Reference 
Consortium (GRC), is now the epicentre for 
genome improvement. 

To improve the reference, the GRC is 
concentrating on three main goals: to cor- 
rect assembly errors; to fill in the genome’s 
remaining gaps; and to produce alternative 
sequences for regions of the genome with 
extensive variability. 

Researchers have had the first two objectives 
on their agendas since the Human Genome 
Project was concluded, and they have been 
chipping away at them ever since. Some of 
the regions have been particularly difficult to 
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“Someone 
needs to have 
responsibility for 
the genome.” 
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polish off. For some repetitive stretches, for 
example, researchers struggled to make mul- 
tiple copies in bacteria — a necessary part of 
the sequencing process. But newer methods 
are now allowing them to fill in these pieces 
of the genome. Earlier this year, a team led by 
Chad Nusbaum, co-director of the Genome 
Sequencing and Analysis programme at the 
Broad Institute in Cambridge, Massachusetts, 
used a next-generation sequencing technol- 
ogy that does not require bacteria to amplify 
the DNA°*. Nusbaum’s team then handed the 
sequences over to the GRC, which incorpo- 
rated them into the reference assembly. 

The third goal reflects something that has 
only recently come to light. At first, researchers 
assumed that genetic variation between people 
largely consisted of differences in single DNA 
letters. Now, however, they better understand 

the extent of structural variations — includ- 
~. ing deletions, insertions, duplications 
and inversions. Although some of these 
variants are involved in herita- 
ble disease, they are much more 
difficult to keep track of than 
single-letter differences because 
they often don't map easily to 
the reference. So instead of rep- 
resenting the genome asa single 
path of three billion letters, the 
GRC is introducing alternative paths to reflect 
its diversity. 


Twists and turns 

One such region is the major histocompatibil- 
ity complex (MHC) — a 4-million-base-pair 
stretch of chromosome 6 that contains many 
immunity-related genes and is recognized as 
one of the most variable slices of the human 
genome. The original reference sequence was 
a hotchpotch of multiple blocks of DNA, called 
haplotypes, taken from several donors, so the 
sequence that resulted didn’t actually exist in 
any real person. To create a reference with 


© 2009 Macmillan Publishers Limited. All rights reserved 


ii) 
= 
roe 


rT F 
=I 


i) 


«Gaps 
<— Alternative loci 


" 


clearer origins, a team led by Stephan Beck of 
the University College London Cancer Insti- 
tute sequenced a single MHC haplotype. They 
then compared it against seven other common 
European haplotypes and discovered more 
than 37,000 single DNA letter differences and 
around 7,000 structural variations — a level of 
genetic diversity about an order of magnitude 
greater than the genomic average’. Beck’s team’s 
reference has now been swapped into the GRC’s 
default sequence and the other seven haplo- 
types are included as alternative pathways. 

Two other regions also have substitute 
haplotypes. One sits on chromosome 4 around 
the gene encoding the UGT2B17 enzyme, which 
metabolizes steroid hormones and many drugs. 
The ‘finished’ reference had misassembled two 
haplotypes, introducing a false gap. A corrected 
assembly found that the ‘gap’ was actually a dele- 
tion found only in some people, flanked by large 
duplications. That section is now included as an 
alternative pathway in the GRC reference. 

The other region, on chromosome 17, 
encompasses the MAPT gene, and provides a 
case study of the limits of the original refer- 
ence sequence, which consisted of only one 
haplotype. An alternative haplotype, a com- 
plex inversion of the first that is found in 20% 
of Europeans, was shown in 2005 to correlate 
with larger family sizes, suggesting that this 
second haplotype was under some form of 
positive selection’. But in 2006, Evan Eichler®, 
a geneticist at the University of Washington in 
Seattle, and two other groups”"’ showed that 
the inverted region was also prone to frequent 
spontaneous deletions that led to mental retar- 
dation. The inverted haplotype seemed to be 
both adaptive and the source of debilitating 
deletions. “You have a yin for the yang here 
determined by genomic structure,” Eichler 
says. “The question was, “What's going on?” 

To answer that question, Eichler needed the 
sequences. He teamed up with Michael Zody, 
chief technologist of the Genome Biology 
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Estimates suggest that more than 400 loci on the human genome 
will require alternatives in the reference genome to capture a 
reasonable amount of human diversity. So far, three have been 
included in the reference, ranging from 590 thousand base pairs 
(kbp) to 4.93 million base pairs (Mbp). 


Chromosome 4 

Contains the gene UGT2B17. The 
original assembly of the genome i 
sequence merged information from two | 
haplotypes. Now, both versions of the 
region are available for reference. 


Chromosome 6 

Contains the major histocompatibility 
complex, which is one of the most variable 
regions in the genome. The locus 
surrounding the complex contains more than 
100 genes, most of which are involved in 
immune responses. One version of this locus 
is the standard, with seven alternative 
versions included in the latest build of the 
genome sequence. 


Chromosome 17 
Contains the gene MAPT. Variants in the region around this gene have been associated with traits such as number 
of offspring and mental retardation. Researchers have shown that some individuals have large inversions of the 
DNA sequence in this region, as well as insertions and deletions, so two versions are provided. 


Program at the Broad Institute, to resequence 
the whole region and showed that the archi- 
tecture of the inverted haplotype predisposed 
the sequence to undergo deletions associated 
with mental retardation”. By the time Eich- 
ler and Zody published their results, in 2008, 
the GRC was already in full swing and gear- 
ing up to release the next build of the genome. 
The researchers handed over their sequences 
to the consortium, and both haplotypes were 
included in the reference sequence. “The GRC 
provided a central clearing house for us to go 
through,” Zody says. 

Given the clinical relevance of these and 
other complex regions, providing multiple 
references is essential to detect the mutations 
underlying many diseases, says Eichler. “Once 
we get the alternative structures worked out, I 
believe we'll be able to make disease associa- 
tions that were previously impossible,” he says. 
Eichler estimates that around 5% of the genome 
— corresponding to around 400 specific loca- 
tions — will need alternative sequences to pro- 
vide a platform that adequately captures the 
spectrum of human diversity. These regions 
include more than 1,000 genes that affect a 
wide range of physiological processes, includ- 
ing immune responses, drug detoxification and 
reproductive ability, he says. 


Acommon task 

The GRC’s first public offering — a more 
accurate version of the human genome — 
rolled out online in March 2009, updated the 
three parts of the genome with the alternative 
assemblies, corrected more than 150 alignment 
problems and closed 25 sequencing gaps. But 
that still leaves more than 300 gaps. In Septem- 
ber, 20 of the GRC’s core members gathered in 
Hinxton for the group’s twice-yearly meeting to 
discuss what steps to take next. While lab work- 
ers were clacking away on their keyboards, bio- 
informaticians were working through one of the 
consortium’s most contentious issues — how to 


change the reference genome to display only the 
‘common gene variants. The GRC’s nine-mem- 
ber scientific advisory board, which includes 
Eichler and Gibbs, has recommended that, 
wherever possible, the genome should include 
the common versions of the DNA sequence. But 
it hasn't defined what ‘common’ means. Should 
it be the highest-frequency variant or some- 
thing that is shared by a reasonable propor- 
tion of the population? Should it be calculated 
across the world’s six billion-plus residents or 
just in a particular ethnic or geographic group? 
Results emerging from the 1000 
Genomes Project, a major inter- 
national sequencing effort to 
catalogue human genetic 
variation in around 2,000 
people from across four 
continents, should 
inform their decisions. 

Some of the GRC 
members disagree with 
making such fundamental 
changes to the reference sequence. “I don't 
think we should be going through and flip- 
ping single bases throughout the genome,” 
says Paul Flicek, who leads the vertebrate 
genomics group at the EBI. “Informatically, 
it just doesn’t matter. As long as it works, I 
think it’s okay.” 

Others outside the GRC question whether 
the entire project is justified. Why bother tink- 
ering with a decade-old reference, asks Lincoln 
Stein, a bioinformatician at the Ontario Insti- 
tute for Cancer Research in Toronto, Canada. 
He calls the effort “more of an abstract exer- 
cise than one that’s going to have a practical 
impact”. Church, for her part, waves off such 
criticisms as being from those preoccupied 
with large-scale genomics. As a detail-oriented 
person, she knows that the little things count. 
Individual investigators love their pet genes. 
That's one reason her queue of tickets is always 
full. And as genomics increasingly moves to 
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the forefront of personalized medicine, many 
regions of clinical utility might slip through the 
cracks. For researchers interested in a particu- 
lar disease-relevant locus “it doesn’t matter that 
the genome may be 99% complete’, she says. “If 
they’re [working with] a region that’s incom- 
plete and wrong, they're screwed.” 

And so the GRC continues with its quiet 
quest, crossing Ts and sometimes changing 
them into As, Cs or Gs. Until a reference is 
no longer needed to assemble the DNA com- 
ing from current sequencing technologies, it 
will continue to document 

the evolving understanding 
_ of human variability in 
) the reference genome. 
| The GRC has also taken 
on the mouse sequence 
and will take responsibility 
for the zebrafish sequence in 
2010. Although it may not 
capture headlines, most in the 
research community recognize its worth. “The 
GRC has exactly the right idea,’ says Jonathan 
Sebat, a geneticist who studies structural varia- 
tion at Cold Spring Harbor Laboratory in New 
York. “It’s a no-brainer that someone would be 
needed to clean up the mess.” a 
Elie Dolgin is assistant news editor for Nature 
Medicine in New York. 
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Out of service 


Decaying infrastructure is an urgent threat that scientists and 
engineers must help to address, says Colin Macilwain. 


flushing the toilet to posting flip com- 

ments on Twitter, rely ona set of criti- 
cal infrastructures. Many of these — water 
systems, transport links, electricity grids and 
generating plants — are ageing severely in 
developed countries. And the ones that aren't 
ageing, such as mobile communications and 
the Internet, are of unknown resilience. 

The sudden collapse of a bridge on Interstate 
35 in Minneapolis, Minnesota, on 1 August 
2007, like the cascade power failure that swept 
the northeastern United States four years 
earlier, was a portent of what could come to 
European and Asian nations if they allow their 
physical infrastructure to deteriorate. 

The US power failure also showed the 
extent to which rich nations’ infrastructure 
has evolved into a complex web of inter- 
dependence, which no one has sought to model 
properly and for which no authority has overall 
responsibility. Energy supply, for example, is 
critical to the operation of all the other infra- 
structures and is itself dependent on water sup- 
ply and telecoms. 

“The major change over the last 50 years has 
been the gradual, but ultimately seismic, shift” 
to an interconnected national infrastructure, 
where “failure in one part has a direct and 
damaging knock-on effect in others’, noted a 
scathing report on the topic published earlier 
this year by the UK Council for Science and 
Technology (CST), the senior science-advisory 
body to the British government. 


ic: trappings of our civilization, from 


Under pressure 
The problem will be further compounded by 
global warming. Even before climate change 
starts stressing existing infrastructure to the 
limit, the need to cut carbon emissions will 
transform utilities’ priorities. “If you look at 
most water companies, for example, their 
biggest bill is electric power. We now have to 
rethink that,” says Paul Jowitt of Heriot-Watt 
University in Edinburgh, the president of the 
London-based Institution of Civil Engineers. 
Economists, civil engineers and other 
infrastructure buffs fear that it will take a 
series of massive failures, akin to the US inci- 
dents, for people to sit up and take notice. In 
the meantime, they hope that scientists and 
engineers can help to address the problem by 
demonstrating the hazards posed by the inter- 
dependency of networks, and by working with 
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regulators and government departments on 
more advanced technological approaches to 
infrastructure repair and maintenance. 

Britain is first in line to confront some 
aspects of this impending collapse because 
parts of its sewers, water system and railways 
date back to the early nineteenth century. The 
country’s problems are compounded by the 
privatizations of the 1980s, which transferred 
the national infrastructure from cumbersome 
but technically competent state bureaucracies 
to profit-driven entities. None of these firms 
has a stake in the 50- to 75-year timescales 
over which infrastructure elements show their 
worth, and many of them have since jettisoned 
research and development to save money. 

In Britain, according to the Organisation 
for Economic Co-operation and Development 
(OECD), investment in water, gas and electric- 
ity infrastructure fell from 0.9% 
of gross domestic product (GDP) 
in the 1970s to 0.5% in 2000-06. 


Korea and Israel, have main- 
tained higher spending levels of 
three or four times as much. But most, includ- 
ing such dirigiste nations as France, have seen 
infrastructure spending fall: in the OECD as 
a whole, it slipped from 1.7% to 0.8% of GDP 
over the same period. 

All around the world, civil engineers, envi- 
ronmentalists and the business lobby are trying 
to push infrastructure decay on to the public 
agenda. And at senior government levels, 
awareness of the issue is growing. Australia — 
like Britain, an early pioneer of privatization 
— passed legislation last year to set up a body 
called Infrastructure Australia to set priorities 
and oversee a Aus$20-billion (US$18.2-billion) 
investment fund. It swung into operation 
quickly, and this May published a comprehen- 
sive set of national priorities. 

Also in May, an act proposing a National 
Infrastructure Development Bank was brought 
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“What we need to 
be investing inis not 
Some places, such as South consumption but 
infrastructure.” 


to the US Congress — but it has made no 
progress. President Barack Obama's stimulus 
package spread spending very widely, and did 
little to directly address a US infrastructure 
spending ‘gap’ that the American Society of Civil 
Engineers estimates at a cool US$1.2 trillion 
over the next five years. “We've made a lot 
of progress at getting the problem onto the 
agenda, but not much progress in solving it,” 
says Blaine Leonard, the society's president. 
In Britain, Prime Minister Gordon Brown 
pledged in July to address one of the CST 
report's central recommendations by estab- 
lishing a coordinating body, Infrastructure UK. 
Details of its remit have not been announced. 


Model future 

Engineers and scientists, says Leonard, should 
“prepare to rebuild our crumbling infrastruc- 
ture with new materials and new technologies, 
in ways that are more resilient and more sus- 
tainable”. Better models are also needed to iden- 
tify critical failure paths in infrastructure, says 
Brian Collins, chief scientific adviser to the UK 
transport and business departments and a co- 
author of the CST report. He says that research 
groups at the universities of Bristol and War- 
wick, and at the London School of Economics, 
have ideas on modelling complex systems that 
might prove useful. 

The UK Engineering and Physical Sciences 
Research Council, which funds two of these 
groups, is now pushing national infrastruc- 
ture as an ideal target for their approaches 
to modelling. The objective is to identify the 
most critical of the hundreds of linkages that 
exist between the different net- 
works. 

So where the money will come 
from to fix these things is any- 
body’s guess. With some excep- 
tions, the institution-building 
that has taken place has yet to make an impact 
in spending decisions. Private capital for infra- 
structure is harder to raise than ever, and most 
observers fear that public investment plans will 
collapse after the recession, as they are easier to 
cut than current expenditure. Dieter Helm, an 
economist at the University of Oxford, UK, esti- 
mated in a September report for the think tank 
Policy Exchange that Britain alone will need to 
raise about £500 billion (US$815 billion) for 
infrastructure over the next ten years. 

“You need a crisis first,’ Helm says. “What 
you need is for it to visibly start to fall to bits. 
Ultimately, people will recognize that what we 
need to be investing in is not consumption but 
infrastructure.” a 
Colin Macilwain is based in the United Kingdom. 
e-mail: cfmworldview@gmail.com 
See go.nature.com/ILx8PC for more columns. 
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Iran's scientists 
condemn instances 
of plagiarism 


The lran chapter of the Academy 
of Sciences for the Developing 
World, speaking for the country's 
academic community, deplores 
the recent cases of alleged 
plagiarism by lranian scientists 
(see Nature 462, 704-705; 
2009). 

Iran's scientific community 
is largely free of such unethical 
behaviour. The calibre of its 
scientific output is reflected by 
the substantial growth in recent 
years in its share of research 
articles published in high-quality, 
peer-reviewed international 
journals. 

Several factors account for this 
improvement in lran’s research 
output, including sustained and 
generous government support for 
science, a swelling of the ranks of 
young researchers and increasing 
international collaboration. 

The Internet has facilitated 
communication with our 
colleagues elsewhere — but the 
availability of journals on the 
Internet has also made plagiarism 
easier. This widely acknowledged 
problem affects the scientific 
community worldwide. Iran, sadly, 
is no exception, and the country's 
science community is overhauling 
its practices to counter this 
scourge. 

But circumstances more 
specific to lran are also conducive 
to the spread of plagiarism. 
lranian culture places an 
excessive emphasis on the 
value of academic credentials, 
both for advancement in official 
professions and in social standing. 
In particular, lran’s political 
class has an unusual affinity for 
possessing academic distinctions, 
as exemplified by the fact that a 
university degree is a prerequisite 
for election to parliament. A 
higher degree is also considered 
an important qualification for 
holding other government offices. 
As aresult, the Iranian political 
class, across the political and 
ideological spectrum, accounts 


for a disproportionate share of 
academic fraud. 

Farhad Ardalan, Hessamaddin Arfaei, 
Reza Mansouri Sharif University 

of Technology and the Institute for 
Research in Fundamental Sciences 
(IPM), Tehran, Iran 

Mahdi Balalimood Mashhad University 
of Medical Sciences, Mashhad, Iran 
Dariush Farhud, Reza Malekzadeh 
Medical University of Tehran, Iran 
Habib Firouzabadi, Keramatollah 
Izadpanah-Jahromi, Afsaneh Safavi, 
Shiraz University, Shiraz, lran 

Ali Kaveh Iran University of Science 
and Technology, Tehran, Iran 

Farrokh Saidi, Abbas Shafiee 

Shahid Beheshti Medical University, 
Tehran, Iran 

Yousef Sobouti Institute for Advanced 
Studies in Basic Sciences, Zanjan, Iran 


Opening dialogue 
between the recent 
and the long ago 


Douglas Erwin's call for 
palaeontologists to move towards 
a better understanding of 
diversity (Nature 462, 282-283; 
2009) should be extended. 
Palaeoecologists should move 
beyond purely descriptive 
objectives and towards a better 
understanding of ecosystem 
evolution. 

The worlds of palaeoecologists 
and ecologists are very much 
apart, and dialogue between 
the two remains muted. (One 
exception is the Quaternary, in 
which several key researchers 
have had a strong ecological 
background.) 

Palaeoecologists working 
across geological time should, we 
suggest, familiarize themselves 
with the paradigms being debated 
in the ecological literature, and 
seek ways with which they 
could be examined in the fossil 
record. Palaeoecological research 
has to rely on uniformitarian 
assumptions using modern 
ecological analogues. Similarly, 
ecologists could be more 
sensitive to the limitations of 
data collection from the historical 
record, and could help frame 


modern studies to complement 
palaeoecological hypothesis 
testing. 

The way modern and fossil 
ecosystems are described will 
necessarily be different, but 
researchers should strive to see 
whether the rules described for 
modern ecosystems hold over 
geological time. After all, the fossil 
record provides the only way to 
study changes in ecosystems over 
more than centennial timescales, 
along with samples of non- 
analogous (‘extinct’) ecosystems 
and habitats. 

As first steps, palaeoecologists 
could publish in leading ecological 
journals and participate in 
ecological conferences. 

Julien Louys, Laura C. Bishop, David 
M. Wilkinson Research Centre in 
Evolutionary Anthropology and 
Palaeoecology, School of Natural 
Sciences and Psychology, 

Liverpool John Moores University, 
Liverpool L3 3AF, UK 

e-mail: j.louys@Ijmu.ac.uk 


UK defence group's 
structure could limit 
its usefulness 


You describe a new UK Ministry 
of Defence programme, ina News 
story (Nature 462, 151; 2009), 
as being modelled on its US 
counterpart, the JASONs: this is 
an independent group of scientific 
advisers with high-level security 
clearance who have consulted for 
the US government on technical 
problems since the cold war. As 
amember of the JASONs for 
more than a decade, | note that 
several organizational aspects are 
crucially different. 

First, the British government 
intends to finance research in 
the laboratories of members 
of their advisory group in order 
to follow up on ideas that they 
generate. Second, most members 
won't have security clearance. 
Third, members will not be self- 
selected, but will be appointed 
by the government. Fourth, the 
membership will be rotated 
frequently according to topic. 
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In my opinion, these differences 
detract from the usefulness 
of the UK enterprise. Lack 
of clearance discourages 
members from bringing up 
relevant issues, and will hobble 
security-related work generally. 
Controversial conclusions of 
reports could be dismissed by 
government officials with the 
all-too-familiar refrain: “You 
wouldn't advise that if you 
knew what we know.” 

Also, the group stands to 
forfeit independence because 
the government will select 
scientists according to its own 
criteria, which tend to include 
political expediency in addition 
to competence. They risk losing 
credibility within the scientific 
community for the same reason. 
When members are funded to 
pursue projects at their home 
institutions, it raises conflict- 
of-interest issues. And rotating 
the membership will prevent the 
organization from ever developing 
a true ‘corporate memory’. 

As an organization, the JASONs 
have earned credibility on a wide 
range of security-related topics 
over the years. The long-standing 
working relationships forged 
among the JASON members 
rank among its most significant 
strengths. The lack of direct 
professional ties to government 
sponsors fosters impartiality. The 
collective experience gained from 
working on so many different 
kinds of problem, combined with 
the individual credentials of its 
scientifically diverse membership, 
make the JASONs one of the few 
government advisory groups 
that can plausibly be called 
independent. 

Steven M. Block Departments of 
Biology and Applied Physics, 
Stanford University, Stanford, 
California 94305-5020, USA 
e-mail: sblock@stanford.edu 


Contributions to this page may be 
sent to correspondence@nature. 
com. Please see the Guide to 
Authors at go.nature.com/ 
cMCHhno. We also welcome 
comments at Nautilus (http:// 
blogs.nature.com/nautilus). 
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Geothermal quake risks must be faced 


Discussion needs to be open about how exploitation of Earth's internal heat can produce earthquakes, says 
Domenico Giardini, so that the alternative-energy technology can be properly utilized. 


being explored as an attractive alter- 

native energy source. Conventional 
hydrothermal resources, such as hot springs 
in geothermal areas, have been effectively 
exploited in the past century, but their distri- 
bution and potential for supplying electricity 
is somewhat limited. Tapping deep geothermal 
energy offers new prospects. 

An enhanced geothermal system (EGS), 
originally called a ‘hot dry rock system, 
involves drilling a hole at least 3 kilometres 
deep into a layer of non-porous rock where 
temperatures are higher than 100°C. Fluids 
are pumped under high pressure into the rock 
(a process called stimulation), which induces 
it to fracture, generating micro-earthquakes, 
thereby increasing its permeability and cre- 
ating a reservoir for the fluid. The ruptures 
generate elastic waves that are detectable by 
sensitive seismic networks. Once a reservoir 
of permeable rock larger than a cubic kilometre 
has been formed, additional holes are drilled to 
extract heat from the rock mass by circulating 
fluids through the fracture network. 

The brute-force approach of EGS is 
attractively simple. And it has, theoreti- 
cally, the capacity to generate large amounts 
of alternative energy by tapping a virtually 
unlimited source — the heat stored deep 
inside Earth. An expert panel convened at 
the Massachusetts Institute of Technology in 
Cambridge in 2006 estimated that EGS could 
provide up to 100,000 megawatts of electricity 
in the United States by 2050, or about 10% of 
the current national capacity — a very large 
proportion for an alternative energy source. 
In October, the United States announced that 
up to US$132.9 million from the recovery 
act would be directed at EGS demonstration 
projects, and big names including Google 
have invested in the technology. 

The drawback is that such enhanced geo- 
thermal systems can induce earthquakes. The 
initial stimulation creates micro-earthquakes 
that might be felt at the surface or even pro- 
duce damage. And the pressurized water forced 
into the rock could interact with existing deep 
faults, generating potentially large quakes. The 
probability of this happening is not large, but 
needs to be considered. In addition, geothermal 
energy is more profitable if it generates electric- 
ity and heating at the same time. That means 
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Enhanced geothermal systems, such as this planned one in California, must undergo quake risk analysis. 


that customers have to be close to the energy 
source, so it is attractive for operators to develop 
geothermal-energy sites in urban areas, where 
earthquakes are more problematic. 

Thousands of deep geothermal sites will 
have to be developed for geothermal energy 
to supply a sizeable component of the global 
energy need. If a significant fraction of these 
induce seismic action under dense urban areas 
that is felt or is damaging, this will exceed the 
natural rate of activity in stable continental 
areas. Man-made rather than natural earth- 
quakes are already the dominant component 
of seismicity in mining districts in countries 
such as Poland and the Czech Republic, but is 
society across Europe and elsewhere ready to 
accept this threat in urban areas? 

In a recent case in California, a planned 
EGS site at the Geysers, a geothermal power 
field about 100 kilometres north of San Fran- 
cisco, met with public resistance and fell under 
review by the Department of Energy (even 
though the company involved had completed 
an appropriate seismicity review). In Septem- 
ber, that project was suspended because of 
technical difficulties. 

For an enhanced geothermal system located 
near a city or in an area already hit by past 
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large earthquakes, the increased seismic risk 
requires developing mitigation strategies, such 
as restricting the pressure or location of pumped 
fluids. Open and comprehensive information 
and education needs to be provided to the pub- 
lic and to authorities before, during and after 
the project. The risks must be openly recog- 
nized and assessed, and thought needs to be 
given to how to insure against damage caused 
by the projects. Discussion is needed with all 
stakeholders — including scientists, politicians 
and the public — to decide what level of risk 
is acceptable. Otherwise society risks a pub- 
lic backlash that could unnecessarily quash a 
promising alternative-energy technology. 


The Basel story 

One of the first purely commercially oriented 
EGS projects — the Deep Heat Mining project 
— was initiated in Basel, Switzerland, in 1996 
by the Geopower Basel (GPB) consortium. In 
my view, what started as a promising green- 
energy initiative turned into a messy affair. It 
is a textbook example of how the failure to 
come to terms fully with the possibility of 
producing earthquakes in an urban area (by 
everyone involved — including the public) 
became in itself the largest risk to the whole 
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concept of geothermal exploitation. We can 
learn important lessons from the case, which 
should serve in securing a long-term future to 
this promising energy source. 

Basel, an industrial centre of Europe’s 
chemical and pharmaceutical industry, bor- 
ders France and Germany, and more than 
700,000 people live in the area. It has a history 
of earthquakes; in 1356, the city was severely 
damaged by a magnitude-6.7 quake, the largest 
ever recorded in central Europe. 

Preparing for a commercial EGS project in an 
industrial zone took several years. In October 
2006, the injection well reached its final depth 
of 5 kilometres, and was ready for the injection 
of high-pressure fluids into the granite. A moni- 
toring system was installed, with six borehole 
seismometers installed near the injection well 
and up to 30 seismic surface stations in the 
Basel area, and a contingency shutdown plan in 
case of felt earthquakes. Nevertheless, the Swiss 
Seismological Service, which had no regulatory 
power in this case, communicated to GPB and 
the Basel authorities that the service had not 
seen what it would consider an adequate seis- 
mic risk analysis for the project. 

The local authority confirms that GPB had a 
valid permit, and had met all that permit’s condi- 
tions. On 2 December 2006, GPB began inject- 
ing water into the well with increasing flow rates. 
As expected, thousands of micro-earthquakes 
were recorded. Because of the strongly increased 
seismic activity felt at the surface, injection was 
stopped on 7 December. A few hours later, a 
magnitude-3.4 event rattled the local popula- 
tion, causing fear and anger, and receiving inter- 
national media attention. In a press release on 
9 December, GPB announced its regret for the 
incident, saying the tremors produced by the 
project were larger than expected. Slight non- 
structural damage, suchas fine cracks in plaster, 
was claimed by many homeowners and paid by 
GPB’s insurance. The incident also led to a court 
case against an individual — not GPB — that 
starts this week. 

Since the water injection 
stopped, seismicity in the 
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Sweeping up: a geothermal project in Basel, 
Switzerland, has been suspended. 


enhanced geothermal energy projects in 
Europe, some of which have been associated 
with earthquakes. The European Hot Dry Rock 
geothermal-energy project in nearby Soultz- 
sous-Foréts, France, has been developed to 
a depth of 5 kilometres over the past decade. 
During stimulation, seismicity was generated 
there with a maximum local magnitude of 2.9. 
The plant was adapted to reduce the earthquake 
risk, and is scheduled to begin producing elec- 
tricity in January 2010. At 2 megawatts, it will 
be the largest commercial EGS site in opera- 
tion. Felt earthquakes are also occasionally 
associated with natural geothermal systems. In 
Landau, Germany, a 3-kilometre-deep system 
was constructed in naturally permeable layers, 
and earthquakes were not expected. How- 
ever, seismicity was felt a year after the start 
of energy production, in 2007, and suspended 
operations for many months. In both of these 
cases the geothermal exploitation is carried out 
in more-rural areas without a known history of 
large earthquakes. 


Realistic approach 

The risk of overreaction to the risks inherent 
in deep geothermal projects is very real. The 
establishment of an overly harsh regulatory 
framework would penalize the geothermal 
industry in comparison to 
other energy sectors that carry 
a recognized risk of inducing 


area has slowly decayed. Three backlash that could seismicity, such as gas extrac- 
years later, sporadic seismicity unnecessari ly quash a tion or coal mining. 

inside the stimulated rock vol- promising alternative From their outset, EGS 
ume is still being detected by es projects need to be thought 
the down-hole instruments. energy technology. of both as pilot projects with 


This EGS project has been 
on hold, awaiting the completion of an inde- 
pendent risk analysis by a consortium of 
seismologists and engineers, selected by state 
authorities following an international bid. The 
study was released on 10 December this year, 
and public authorities have now decided to 
suspend the project. 

There have been several other forays into 


scientific unknowns and 
as commercial ventures with technological 
and financial risks. Companies need to have 
allocated enough of their budget to scien- 
tific investigations not directly related to the 
exploitation of heat. Local authorities need to 
avoid being enticed by the promises of alterna- 
tive energy, and to remember to ask the right 
questions. Risk evaluations need to be done 
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before — not after — these projects begin. 

Even if the right questions are asked at the 
right time, the scientific and engineering com- 
munity is hard pressed to provide a consensus 
opinion on how seismic hazards can be assessed 2 
with confidence and minimized. The empirical © 
data include only a handful of well-monitored $ 
EGS experiments; models are consequently & 
poorly constrained. The European Commis- 
sion has approved the Geothermal Engineering 
Integrating Mitigation of Induced Seismicity = 
in Reservoirs (GEISER) project to improve 
the knowledge base and suggest procedures 
and regulations for the future exploitation of 
deep geothermal energy. However, many EGS 
projects are expected to open in the years before 
the GEISER project produces useful results. 

The Basel programme is likely to have a 
strong effect on the insurance cost of future 
projects associated with induced seismicity. 
The damage claims in Basel amounted to more 
than $9 million, which seems a high toll for 
a local magnitude-3.4 event (although this is 
hard to say definitively, because data on small 
non-structural damages from past earthquakes 
have never been comprehensively collected). 
The damage in each building never reached 
the 10% property level that is normally applied 
as deductible by home insurance policies. For 
a natural event, the damage would have been 
covered by the homeowners, but for a man- 
made event, the whole cost was picked up by 
the company’s liability insurance. This of course 
opens a difficult issue. How would we treat a 
magnitude-5.5 earthquake hitting Basel in, say, 
30 years? Could we prove whether it was natural 
or not? Who would cover the damage? 

The public reacts with a vengeance if it 
perceives that a known problem has been 
hidden. More than this, earthquakes invari- 
ably raise primordial fears. Waking up the 
sleeping terror that lurks in the deep is the 
plot of numerous horror movies; here it has 
an all-too-real meaning. 

It is now becoming clear to the public, local 
authorities, the geothermal industry and regu- 
latory agencies that deep geothermal systems 
carry a small risk — as do most technologies 
in the energy sector. Dams can break, nuclear 
power plants may fail, carbon dioxide released 
from the oil and gas contributes to global 
warming, and EGS projects can create damage 
through induced earthquakes. The open ques- 
tion is whether or not society is able to find 
ways to balance and accept these risks. A well- 
informed discussion is needed to find out. ™ 
Domenico Giardini is director of the 
Swiss Seismological Service, ETH Zurich, 
Sonneggstrasse 5, CH-8092 Zurich, Switzerland. 
e-mail: giardini@sed.ethz.ch 
See go.nature.com/rk5jgK for further reading. 
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A vision of the nanoscale 


A collaborative effort between a photographer and a chemist could show scientists how to make 
the small scale more intuitive, says Jeremy Baumberg. 


No Small Matter: Science on the Nanoscale 
by Felice C. Frankel and 

George M. Whitesides 

Belknap Press (Harvard University Press): 
2009. 192 pp. $35, £25.95, €31.50 


On the bookshelves of my childhood, I 
remember science encyclopaedias that were 
filled with images of natural and synthesized 
wonders. No modern shelf should be without 
titles that chart today’s exploration of science’s 
outer limits. But whereas hoards of glossy 
books have been published on astronomy, for 
example, the small world has been shyer to 
emerge. Perhaps this is partly because nano- 
science, like postmodern art, seems to demand 
a lengthy explanation of what we are view- 
ing — but at the expense of an unmediated 
experience. 

Reorienting our eye to the nanoscale is 
No Small Matter. This coffee-table book juxta- 
poses images and ideas to encapsulate the sig- 
nificance of size and shape. It is the product of 
a second collaboration between science pho- 
tographer Felice Frankel and chemist George 
Whitesides, and follows their project to capture 
images of pattern formation on surfaces (On 
the Surface of Things; Harvard University Press, 
1997). Exploring where art meets science, the 
authors search for promising paths to make 
small-scale science more intuitive. 

Science-art collaborations frequently end 
up as one-way processes, with science donat- 
ing the metaphors to a consumerist arts 
culture that fashions them into a new end 
product. Frankel avoids this trap by using 
images — photographs on a human scale and 
through microscopy — to frame graphic icons 
that encapsulate a theme and draw in the 
viewer. As with much of art, the explanatory 
power of these pared-down representations 
relies on the previous experiences of the per- 
son who enquires of it, so the collection is a 
hit-and-miss affair. For instance, my response 
to their photo of an abacus that introduces 
the theme of counting in binary — clicking, 
serried ranks — might be very different from 
yours. 

Partnered with each photograph is a short, 
blog-like column. The text informs while the 
image reinforces the text’s metaphors, each 
pair offering a self-contained journey. The 
book’s themes — conveyed in textbook-like 
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This glass apple's curious shadow (part round, part cube) symbolizes the duality of the quantum world. 


sequence by bold numbering — include struc- 
tural and conceptual underpinnings at the 
small scale; life and engineering; and nano- 
technology's potential risks and prospects. It is 
a highly personal tour of science that benefits 
from the clear intuition born from Whitesides’ 
experience. Such thoughtful tactics epitomize 
the difficulty of making the revelations of 
nanoscience accessible. 

The weaving of an eminent nanoscientist 
and an inspired photographer creates a some- 
times inconsistent cloth. Some images inspire 
awe, such as the delicate cage of the marine 
sponge known as Venus’ flower basket, or the 
grasping arrays of polymer rods described as 
nano-fingers. Others lack depth, such as the 
ubiquitous candle flame. And yet others don't 
quite illuminate, such as a depiction of the 
quantum world through an artificially rendered 
cubical shadow of a spherical apple. Forcing 
such visual simplicity leads to metaphors that 
hang flapping — for example, a full or empty 
wine glass caricatures the digital nature of a 
binary bit, but it doesn't help to explain the 
elegant binary logic of 1+1=0. Nonetheless, 
the authors’ engagement is stimulating. 

The book does convey the idea that shape 
and size matter hugely to nanoscale function. 
It captures well the mystery of how the layer- 
ing of these simple concepts generates the 
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complexity of life. But the authors’ attempts 
to justify the funding and ambition of nano- 
science are awkwardly conflated with societal 
challenges, such as clean-energy generation. 
Although there are connections, they don't 
emerge simply from the preceding material, 
leaving the reader bewildered. 

The big-science communities of particle 
physics, genomics and astronomy are highly 
organized when disseminating knowledge 
and lobbying on behalf of their fields. Despite 
its much greater number of practitioners, the 
nanoscience crowd has not spread its mes- 
sages across wider society. Using images such 
as these to build people’s familiarity with 
nanoscience can provide a visual shortcut 
that connects emerging stories with a broader 
message. 

Frankel and Whitesides’ book adds gravi- 
tas and nuance to the popularization of 
nanotechnology, articulating its interest and 
vast opportunities. Rather than being stuck 
on a teenager’s bookshelf, No Small Matter 
should lie open on our coffee tables, inviting 
comment. | 
Jeremy Baumberg is professor of nanophotonics 
and director of the Nano Doctoral Training 
Centre at the Cavendish Laboratory, University of 
Cambridge, Cambridge CB3 OHE, UK. 
e-mail: j.j.baumberg@phy.cam.ac.uk 
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Pop-up physics 


A Voyage to the Heart of Matter: 

The ATLAS Experiment at CERN 

by Emma Saunders with Anton Radevsky 
Papadakis: 2009. 8 pp. £20 


Finally up and running after a 14-month 
repair job, the Large Hadron Collider (LHC) 
at CERN, Europe’ particle-physics laboratory 
near Geneva, Switzerland, is the most complex 
experiment in the world and took thousands of 
physicists well over a decade to assemble. So it is 
perhaps to inspire empathy that an LHC pop-up 
book asks its readers to spend a few minutes 
fumbling with pieces of paper that (eventually) 
fold into a model of the giant ATLAS detector, 
one of four detectors at the collider. 

This feature and others do not necessarily 
mean that A Voyage to the Heart of Matter is 
aimed at young children; indeed, its author, 
Emma Saunders, admits that she would not let 
her three-year-old son flip through it unsuper- 
vised. But it is perfectly suited for any adult 


Detector origami: the pop-up model of ATLAS. 


with even a passing interest in the LHC anda 
desire to try their hand at detector assembly. 
The book manages to pack an incredible 
amount of information about the collider into 
just four spreads. The first guides readers 
through the layout of the collider and its design; 
the second and third describe the ATLAS detec- 
tor in detail; and the final spread, the book's 
most beautiful, shows galaxies in flight around 
a swirling maelstrom of primordial particles 
similar to those being hunted by the LHC 


physicists. In addition to a main pop-up, each 
page contains four flaps that open to reveal 
further facts about the collider and its goals. 

The book’s greatest charm is its obsession 
with detail. There is no reason one would have 
to include an ‘inner detector in the ATLAS 
mock-up; yet physicists would surely object 
if it was not there. Similarly, the fountain- 
like recreation of the Big Bang, in addition to 
being beautiful, is highly accurate. At its base 
— corresponding to The Beginning — lie 
unconfined quarks, while the upper layers cor- 
respond to the cosmic microwave background, 
the beginnings of large-scale structure, and the 
eventual formation of galaxies. 

Most important, the pop-up format actually 
improves the book. Rather than being gim- 
micky, each pop-out genuinely illuminates the 
workings of the detector and the interactions of 
the particles it hopes to find. The book would be 
the perfect holiday gift for any armchair physi- 
cist who wants a little taste of life at the LHC. m 
Geoff Brumfiel is a senior reporter for Nature. 


Trust puts the self on show 


Identity: Eight Rooms, Nine Lives 
Wellcome Collection, London 
Until 6 April 2010 


A maze of corridors winds through featureless 
partition walls, adorned only by mirrors rang- 
ing from the exotic (Etruscan artefacts from the 
collection of Sigmund Freud) and the surreal 
(a digital ‘time-lapse’ mirror in which one’s past 
movements materialize belatedly, like a ghost) to 
the banal (a cheap, plastic shaving mirror once 
belonging to actor Michael York). The corridors 
feed into eight rooms, each exploring an aspect 
of the knotty question of who we are. 

Identity launches a series of national events 
organized by UK science-funding body the 
Wellcome Trust. At this showing, Ken Arnold, 
head of public programmes at the Wellcome 
Collection in London, announced that the 
upcoming tenth anniversary of the draft human 
genome seemed a timely moment to step back 
with a mix of medicine, popular culture and 
high art, designed to get at a “dauntingly broad” 
question. “We quickly found that philosophy 
and neuroscience really didn’t have answers 
to some very basic questions about identity,” 
said one of the co-curators, Hugh Aldersey- 
Williams. “Meanwhile, biology, society and law 


find themselves increasingly in conflict over 
these basic issues.” Their strategy in tackling 
the vast subject was to choose eight topics, each 
introduced by a figurehead personality. 

One of these is geneticist Alec Jeffreys. Behind 
glass are artefacts from his childhood: a well- 
loved copy of Biggles Works It Out; a school 
report displaying all A grades (including 19.5 out 
of 20 in science), apart from a B in writing anda 
C+ in physical training and games (“tries hard”). 
In another case, the trappings of Jeffreys’ scien- 


tific life are arrayed like ancient relics: a battered 
Geiger counter, an X-ray film with scattered 
black bands; a pivotal Nature paper (Nature 
317, 818-819; 1985). Among these everyday 
tangibles, Jeffreys’s eureka moment is mapped 
with remarkable precision: “At 9 a.m. on Mon- 
day 15 September 1985, he found what he was 
looking for,’ says the placard. “By afternoon he 
had coined the phrase ‘DNA fingerprinting:” 
Unlike the other isolated rooms, Jeffreys’s is 
fused with that of Francis Galton, the Victorian 
polymath who pioneered fingerprinting as a 
means of identification. Aldersey-Williams con- 
fesses to an attempt to draw “cheeky parallels” 
between the trivialities of the two men. Thus, 


Francis Galton and Alec Jeffreys (centre) both pioneered fingerprinting: the first using ink, the latter, DNA. 
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the childhood memorabilia of Galton includes 
a worn copy of the Iliad and a letter to a relative, 
in which the four-year-old brags about being 
able to read “any English book” and recite 
“all the Latin substantives”. Galton’s slant on 
identity was all about phenotype: on display 
are the composite photographs with which he 
tried to distil the facial essence of particular 
groups: criminals, asylum patients, Baptist 
ministers — even scientists. Further parallels 
are introduced in the room devoted to physio- 
logist Franz Joseph Gall, which is full of the 
trappings of phrenology — moulds of crania 
and masks of faces, including those of Voltaire 
and Isaac Newton. These are juxtaposed with 
videos of scans using magnetic resonance 
imaging, which show areas of the brain light- 
ing up during difficult decision-making or jazz 
improvisation — the sort of modern phrenol- 
ogy of which Gall might have approved. 
Other rooms explore the perceptions of self 
from a less scientific vantage. One displays 
diaries, including that of Samuel Pepys and 
of Clive Wearing, a pianist with anterograde 
amnesia whose inability to form new memories 
means that every line reflects perfect imme- 
diacy, as if recording the first moment of his 
life. Visitors are invited to sit down in the actual 
Big Brother chair and mingle with traces of D-list 
celebrity sweat. There is a room about twins. In 
it, two identical twins struggle to distinguish 
themselves as they age, but the series of photos 
culminates in an adult pose that is uncon- 
sciously mirror-image: the result, we are told, 
of an egg that split early on in development. 
The genetics of gender is addressed in a 
room about April Ashley, one of the first people 
in Britain to have a complete sex-change opera- 
tion. A succession of press clippings charts not 
the transformation of someone from male to 
female, but rather the relentless battle of some- 
one who always thought she was a woman. 
This paper trail ends ina retrospective change 
of gender on Ashley’s birth certificate — a 
reinventing of biological history that might 
feel vaguely uncomfortable to the average visi- 
tor. Yet the recent public outrage on behalf of 
Caster Semenya, the world-champion middle- 
distance runner whose gender is still in dispute, 
shows that many of us are prepared to define 
sex by means other than strictly genetic. 
Co-curator James Peto admits that, at the 
beginning, he thought covering a topic such 
as identity was completely overwhelming. “You 
can only scratch away at little bits and hope- 
fully raise enough questions for people to start 
finding answers for themselves.” I] 
Jennifer Rohn is a cell biologist at University 
College London and editor of LabLit.com. Her first 
novel is Experimental Heart. 
e-mail: jenny@lablit.com 
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Mona Hatoum's 
Hot Spot globe. 


Artistic dispatches on climate 


Earth: Art of a Changing World 
Royal Academy of Arts, London 
Until 31 January 2010 


Photographs of our blue planet, taken 
during the 1968 Apollo 8 lunar mission, 
transformed our grasp of its fragile equilib- 
rium. In 2009, we need similarly defining 
images to galvanize interventions to mitigate 
climate change. Aiming to provide just that 
is the exhibition Earth: Art of a Changing 
World, organized by the Royal Academy of 
Arts in London in collaboration with Cape 
Farewell, a charity that encourages artists to 
engage with the science of climate change. 
It examines how the global-warming debate 
has influenced the practice of more than 
30 international contemporary artists, some 
of whom have participated in Cape Farewell 
expeditions to the Arctic. 

On entering the exhibition — aptly on 
show in the building formerly occupied by 
the now defunct Museum of Mankind — 
visitors are confronted by the UK sculptor 
Antony Gormley’s Amazonian Field (1992), 
which comprises 15,000 fired clay figures 
with questioning expressions. “I wanted to 
make a work about our collective future and 
our responsibility for it”) Gormley has said 
of his sculpture, hand-modelled by people 
living in the Amazon basin. In contrast to 
this cooperative work, the Palestinian sculp- 
tor Mona Hatoums Hot Spot (2006) alludes 
to human conflicts at contested borders. 
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A skeletal globe is tilted at the same angle as 
Earth, its stainless steel latitudes and longitudes 
supporting continents outlined in glowing neon 
tubes. Hot Spot hints at the increasing global 
unrest that could be caused by water shortages 
resulting from climate change. 

Also on show is the work of the UK artists 
Heather Ackroyd and Dan Harvey, participants 
in several Cape Farewell expeditions. As 
part of their ongoing project, Beuys’ Acorns 
(started in 2007), they have planted young 
oak saplings — grown from acorns collected 
from trees planted by the pioneering German 
ecological artist Joseph Beuys in 1982 — on the 
portico outside the exhibition. Growing more 
trees, rather than felling the world’s forests, is 
their symbolic and optimistic act in the face of 
justifiable pessimism about climate change. 

“We have come to this ship in a frozen fjord 
to think about the ways we might communi- 
cate our concerns about climate change to a 
wider public,” said UK writer Ian McEwan of 
the purpose of his and other artists’ 2005 Cape 
Farewell expedition. His latest novel, to be 
published next year, deals with climate change. 
For now, his text The Hot Breath of Our Civili- 
sation (2005), is exhibited on the gallery wall 
as a scrolling display. It provides an epilogue to 
the exhibition: “Are we at the beginning of an 
unprecedented era of international coopera- 
tion, or are we living in an Edwardian summer 
of reckless denial? Is this the beginning, or the 
beginning of the end?” a 
Colin Martin is a writer based in London. 
e-mail: cmpubrel@aol.com 
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EXTRASOLAR PLANETS 


Water world larger than Earth 


Geoffrey Marcy 


The hunt for Earth-like worlds has taken a major step forward with the discovery of a planet only 2.7 times 
larger than Earth. Its mass and size are just as theorists would expect for a water-rich super-Earth. 


Momentous breakthroughs in science often 
come unexpectedly and serendipitously, 
requiring decades of patience. Only rarely 
does a long-sought scientific frontier loom so 
prominently just beyond the horizon that the 
next generation of instruments seems sure to 
reach it. A tantalizing case for such a break- 
through is presented by Charbonneau et al.’ 
on page 891 of this issue. They provide the 
most watertight evidence so far for a planet 
that is something like our own Earth, outside 
our Solar System. 

Charbonneau and his co-workers developed 
a simple and forward-looking planet-hunting 
technique. They installed a suite of eight 
amateur-sized telescopes (with 40-cm-diam- 
eter mirrors), each with a sensitive charge- 
coupled-device light detector that measures the 
near-infrared brightness (wavelengths of about 
700-900 nanometres) ofa star. Any star whose 
brightness dims for about an hour, and repeats 
that dimming like clockwork over the course of 
days and weeks, is probably doing so because 
an orbiting planet is crossing briefly in front 
of it, blocking a fraction of the star’s light. The 
amount of dimming directly indicates the size 
of the planet relative to that of the star. From 
a large sample of nearby stars’, Charbonneau 
et al. have smartly chosen the 2,000 of smallest 
radii, so that near-Earth-sized planets would 
blockat least 1% of a star’s light, rendering such 
worlds detectable. 

Charbonneau’s team’ has found that the 
small, faint star GJ 1214 undergoes repeated 
dimming of 1.3% for 52 minutes every 1.6 days. 
The only plausible interpretation is that a planet 
orbits the star with an orbital period of 1.6 days 
and that it has a radius that is 12% that of the 
star. Good estimates of the star’s radius (21% 
that of the Sun) put the planet’s radius at only 
2.7 Earth radii. Such a small planet orbiting 
a star other than the Sun is an extraordinary 
find. With the tools currently available, only 
one other extrasolar planet has been reported 
that is thought to be close in size to Earth, 
namely CoRoT-7b, at 1.7 Earth radii. The new 
planet, which is only about 13 parsecs away, is 
named GJ 1214b. Importantly, it pulls gravi- 
tationally on its host star, causing the star to 
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Figure 1| A water-rich super-Earth? The newly 
discovered’ extrasolar planet, GJ 1214b, probably 
contains a huge amount of water, surrounding 

an inner core of iron and nickel, and an outer 
mantle of silicate rock, and may have a small 
atmosphere of hydrogen and helium. Only 

2.7 times larger than Earth, and just 13 parsecs 
away, this super-Earth brings astronomers closer 
to discovering Earth-like planets. 


move with a speed of 12 m s |, which the team 
has detected through measurements of wave- 
length shifts in the star’s light (the Doppler 
effect). The planet’s inferred mass is a mere 6.6 
Earth masses, which, when combined with its 
radius, leads to a density of 1.9 g cm”. By con- 
trast, Earth’s average density is much higher, 
at 5.5 gcm ~. Because water has a low density 
of about 1 g cm“, the chemical composition 
of the new planet is probably some admix- 
ture of rock and water, with perhaps a small 
atmosphere of hydrogen and helium. 

Could this planet have a solid surface suitable 
for hosting organic-rich ponds and lakes? Some 
astronomical background offers a good guess. 
The protoplanetary disks of dust and gas swirl- 
ing around young stars are the sites of planet 
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formation. The disks are made up of the same 
admixtures of H and He gas, carbon, nitrogen 
and oxygen compounds, and iron and nickel 
metals found in nearly all of the stars in our 
Galaxy, including the Sun. Solid dust particles 
made of Fe, Ni, silicates and ices stick together 
and grow into ever larger planetesimals, 
forming the basic cores of all planets. 
The relative amounts of these solid 
constituents vary only modestly 
among different protoplanetary disks, 
for two reasons. First, the abundances 
of the atomic elements are nearly the 
same, within factors of two, from star 
to star, with C, N, O, silicon, mag- 
nesium and Fe being the building 
blocks of the solid material. Second, 
the highly negative Gibbs free energy of 
carbon monoxide and silicates locks up 
as much oxygen as the limiting reagents — 
Cand Si — permit, leaving plenty of oxygen 
to form water ice, despite its higher Gibbs free 
energy. Thus, silicates and water ice dominate 
the mass budget of the solid material in the 
cold regions of protoplanetary disks, along 
with the Fe and Ni dust grains. 

That solid material forms the building blocks 
of large planets such as Saturn and Neptune, 
and perhaps smaller planets as well, such as the 
new one’. But the density of 1.9 g cm ™ for this 
new planet imposes a constraint on the rela- 
tive amounts of each constituent. To keep the 
planet's density that low requires that it contains 
large amounts of water. If the planet were pure 
Fe and silicates, its density would be similar to 
Earths. It must contain a huge amount of water, 
roughly 50% by mass. 

The wild card is the amount of H and He 
gas in the atmosphere. Spooning additional H 
and He (of low density) onto a planet makes 
its density lower, which can be compensated 
for by increasing the amount of Fe in the core 
to bring the overall density to that measured, 
1.9 g cm. But the planet-building environ- 
ment is unlikely to spawn planets composed 
of mostly Fe and H/He, but very little water. 
Any planet that contains Fe, rock and H/He 
would have also retained correspondingly large 
amounts of water. Thus, it is likely that this 
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new world has nearly 50% of its mass in water 
surrounding an Fe/Ni core and a silicate mantle 
(Fig. 1). It probably has an extraordinarily deep 
ocean, which would be liquid given its equi- 
librium surface temperature of some 190°C 
due to heating from the host star. A sauna-like 
steam atmosphere is possible, with slow photo- 
lytic and hydrodynamic loss of that atmosphere 
caused by ultraviolet-light irradiation. A thin 
H/He outer atmosphere is also possible. 

And so comes the profound anthropocentric 
question. If this planet is 50% water, is it really 
kin of our Earth? Or did it form in a manner 
similar to that of Saturn or Neptune, with a 
rocky core that acquired large amounts of ices 
and gas gravitationally? By contrast, Earth has 
only 0.06% water, and very little H and He gas, 
having formed ina dry environment. This new 
planet is close to Earth in size, but perhaps not 
next of kin. 

Nonetheless, Charbonneau’s team’ has high- 
lighted a promising future for the discovery of 
Earth-like worlds. Their efforts are just begin- 
ning, with smaller and rockier planets yet to be 


found. Meanwhile, precise Doppler measure- 
ments may reveal the gravitational wobbles of 
stars caused by Earth-like planets in tight orbits. 
Most promising is NASA’s Kepler mission. 
Launched in March 2009, Kepler is monitor- 
ing 100,000 stars and is able to detect dimming 
as small as one part in ten thousand of their 
normal brightness, rendering truly Earth-sized 
planets easily detectable. And someday, great 
space-borne interferometers (such as NASA’s 
Space Interferometry Mission) and enormous 
cameras will be launched, able to detect, image 
and spectroscopically analyse the landscapes, 
oceans and atmospheres of nearby rocky 
planets. These techniques will surely answer 
the question Aristotle, Epicurus and Demo- 
critus posed 2,400 years ago regarding Earth’s 
unique status in the Universe. | 
Geoffrey Marcy is in the Department of 
Astronomy, University of California at Berkeley, 
Berkeley, California 94720, USA. 

e-mail: gmarcy@berkeley.edu 
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DNA REPLICATION 


Prime-time looping 


Nicholas E. Dixon 


When the replication machinery copies DNA, it must unwind the double 
helix in one direction while synthesis of one of the strands proceeds in the 
other. Making transient DNA loops may solve this directional dilemma. 


If you are a cell about to divide, you will first 
need to use a multi-protein machine called a 
replisome to simultaneously make copies of 
both strands of your chromosomal DNA so 
that one strand can be passed to each daugh- 
ter cell. Replisomes have long been thought to 
couple synthesis of both DNA strands by form- 
ing a ‘trombone loop’ of DNA that expands 
and relaxes as synthesis takes place discontinu- 
ously on one of the strands. Two papers, one 
by Pandey et al.’ on page 940 of this issue and 
another by Manosas et al.” published in Nature 
Chemical Biology, show that a second type of 
loop, called the ‘priming loop, is transiently 
produced in the replisome. 

The replisome faces special challenges as 
it makes new DNA at rates that can approach 
1,000 nucleotides per second. Unlike the 
machines that make proteins and RNA, which 
work relatively sluggishly and in a linear fash- 
ion, the replisome must simultaneously copy 
two strands of DNA that are aligned in oppo- 
site directions (5’ to 3’ and 3’ to 5’). Replisome 
chemistry obeys two rules. The first is that 
a DNA polymerase (the component of the 
replisome that synthesizes new DNA froma 
template strand) can extend the newly formed 
DNA chain only in the 5’ to 3’ direction. This 
means that it can continuously copy only one 
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Figure 1| DNA replication by a minimal replisome. 
During DNA replication by the replisome 
components, the DNA strands are separated by 
the helicase enzyme and replicated by the leading- 
and lagging-strand DNA polymerases. As DNA 
can be copied only in the 5’ to 3’ direction, the 
polymerase continuously copies the leading 
strand, but the lagging strand is made in shorter 
pieces, or Okazaki fragments, that are joined 
together later. DNA synthesis begins by extending 
a nucleic-acid primer that is synthesized at 
priming sites by the primase enzyme. 
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of the two DNA strands, called the leading 
strand. The lagging strand must be made in 
shorter pieces that are joined together later. 
These pieces, or Okazaki fragments, are a few 
thousand bases in length and each is made 
every few seconds. 

The second rule is that a DNA polymerase 
cannot start a DNA chain — it can only extend 
a pre-existing DNA or RNA chain, called a 
primer. So all cells have a specialized enzyme, 
the primase, that makes the first RNA primer 
for each DNA chain. A new primer must there- 
fore be made every few seconds to be used for 
Okazaki-fragment synthesis on the lagging- 
strand template. This single-stranded template 
DNA is produced by the helicase, a component 
of the replisome that, in bacteria, moves ina 
5‘ to 3’ direction to separate the two strands of 
the double helix (Fig. 1). And herein lies the 
problem — the primase needs to be associated 
with the helicase to function, but the primers 
on the lagging strand are made in the direc- 
tion opposite to the movement of the heli- 
case. Moreover, primer synthesis is relatively 
sluggish, taking about a second or so. 

There are three possible solutions to the 
replisome’s problem. One is for the whole repli- 
some to pause while the primer on the lagging 
strand is made, then to resume its work; such 
pauses have been reported by the van Oijen 
group’ during primer synthesis by the bacterial 
virus (bacteriophage) T7 replisome (Fig. 2a). 
The second solution is for the primase, once 
clamped onto the lagging-strand template 
by the helicase, to be promptly released to 
make its primer at leisure, as happens with the 
Escherichia coli replisome’ (Fig. 2b). 

The third solution is for the replisome to 
continue leading-strand synthesis while the 
helicase-primase complex takes its time to 
make the primer. The helicase continues to 
unwind DNA in the forward direction while 
the physically linked primase makes a primer 
in the opposite direction. This arrangement 
produces a transient single-stranded DNA 
loop in the lagging-strand template, termed the 
priming loop, which is subsequently released 
to become part of the trombone loop when 
the primer is passed to the lagging-strand 
polymerase (Fig. 2c). 

The new reports’” use elegant single-mol- 
ecule experiments to provide the first direct 
experimental evidence for priming-loop for- 
mation by the bacteriophage T7 and T4 repli- 
somes. Pandey et al.’ worked with the whole 
T7 replisome, which has an unusual structure 
in that its primase and helicase are part of the 
same protein, so primase release is impossible. 
The authors used short DNA templates that 
were already primed on the leading strand, 
with priming sites (DNA sequences required 
for primer synthesis) on the lagging strand. 
Although lagging-strand primer synthesis 
occurred about 50% of the time, synthesis of 
the leading strand showed no sign of pausing 
while a primer was made. Next, the authors! 
employed a technique called fluorescence 
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Figure 2 | Three priming mechanisms. Interaction of the primase with the helicase is necessary 

for primer synthesis at a lagging-strand priming site. The primase makes primers in the opposite 
direction to helicase movement, leaving three ways by which the replisome might resolve this 
directional problem. a, The whole replisome can pause for primer synthesis; b, it can promptly release 
the primase; or ¢, as described for the first time by Pandey et al.'! and Manosas et al.’, the replisome can 
continue to move forward while the primase-helicase interaction persists. This produces a priming 
loop that eventually collapses into the lagging-strand trombone loop, probably on transfer of the 


primer to the lagging-strand polymerase. 


resonance energy transfer (FRET), which uses 
the interaction between fluorescent dyes as a 
readout of the proximity of molecules to each 
other. The dyes were arranged on the lagging- 
strand template so that they would come close 
enough together for FRET to occur if a prim- 
ing loop were formed. FRET was observed only 
under conditions where, and about as often 
as, primers were made. The FRET data’ can 
be explained only by the formation of a prim- 
ing loop on the lagging-strand template while 
leading-strand synthesis continues (Fig. 2c). 
In another single-molecule study, Manosas 
et al.’ studied the T4 replisome, in which the 
primase and helicase are separate proteins that 
interact during primer synthesis. They used 
an ingenious experimental design consisting 
of a double-stranded DNA hairpin structure 
that contains priming sites when in a single- 
stranded form. The DNA is attached to a mag- 
netic bead that is stretched at a constant low 
force by a magnetic field. Videomicroscopy of 
the bead movement allows measurement of the 
length of the DNA. As the helicase converts 
the hairpin to single-stranded DNA, the DNA 
lengthens and then subsequently contracts as 
the hairpin reanneals behind it. The changes in 
DNA length allow measurement of the rate of 
helicase action in real time. Using this system, 
the authors” showed that helicase-primase 
interaction and subsequent primer synthesis 
did not result in helicase pausing. Most of the 
time, reannealing of the hairpin was blocked 
by the persistence of a primase-bound primer, 
indicating that the primase had been released 
promptly by the helicase at the priming site 
(Fig. 2b). Less frequently, the rate of DNA 
lengthening decreased for about half a sec- 
ond, and then there was an immediate jump in 
length. This observation can be explained only 
by the formation and subsequent release of a 
priming loop (Fig. 2c). When the helicase and 


primase were artificially fused together as in 
the T7 replisome, priming-loop formation was 
markedly increased, and blocks to reannealing 
(by released primase-bound primer) were not 
observed. 

An unusual aspect of Pandey and colleagues’ 
work' is the high efficiency of priming achieved 
by the T7 primase on their short templates. 
Priming sites are trinucleotides that occur 
frequently in single-stranded DNA templates. 
They are generally used inefficiently by the 
primase for primer synthesis, and it is thought 
that only a fraction of primers are functionally 
extended by the lagging-strand polymerase. 
These factors account for the relatively long 
(1-2 kilobases) Okazaki fragments. When 
studying lagging-strand priming during lead- 
ing-strand synthesis by the T7 replisome on 
long templates, the van Oijen group’ clearly 
observed pauses coincident with primer synthe- 
sis. These occurred at relatively low frequency, 
consistent with the size of Okazaki fragments 
— but the authors’ single-molecule experi- 
mental set-up could not detect priming loops. 
Reconciliation of these observations’ with 
those of Pandey et al.’ is not straightforward, 
and may indicate that replisome pausing occurs 
during or soon after functional primer syn- 
thesis, while mechanisms involving primase 
dissociation and priming-loop formation 
ensure that the replisome is not unnecessarily 
slowed during more frequent, non-productive 
priming events. a 
Nicholas E. Dixon is at the School of Chemistry, 
University of Wollongong, Wollongong, 

New South Wales 2522, Australia. 
e-mail: nickd@uow.edu.au 
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50 YEARS AGO 

A Survey of Soils inthe Kongwa 
and Nachingwea Districts of 
Tanganyika. By B. Anderson — 
Everyone knows the dismal 
sequel to the ambitious scheme 
for the mechanized production 
of ground-nuts in East Africa, 
which was characterized by the 
failure to employ pedological 
methods on pedological 
problems. After the horse had 
departed, some effort was 
made to shut the stable door 
and avery competent soil 
surveyor was set to work to 
make a proper study of the 
soils of the Kongwa and 
Nachingwea districts. The 
publication under review 
presents his results and shows 
what can be achieved by one 
trained pedologist working 

‘on the cheap’ with limited 
facilities, but with specialized 
technical assistance from 
various institutions. The moral 
for would-be planners of 
land-use is obvious. 

From Nature 19 December 1959. 


100 YEARS AGO 

It may be of interest to record 

a fact which has come under 
my notice while engaged in 

the development of a uranious 
mine in Turkestan. The ore is 
oxidised and calcareous, and 
contains uranium, vanadium, 
and copper, radium being 
present in accordance with 
Prof. Rutherford’s formula, 
which gives the quantity of it 

in relation to the uranium. The 
uranium is on the average 3.8 
per cent., but in some places 
reaches the ratio of 30 per cent. 
and more... As | know from the 
literature of the subject that 
vanadium and uranium are 
toxic substances, | instruct the 
workmen to wash their hands 
well before going to their dinner 
and after their work. “We do 
this,” they say, “but at the same 
time we know that in actual 
practice a cut ona hand, 

which lasts for a long time 

in acoal mine, here, when 
powdered by the ore, gets 

well very quickly.” 

From Nature 16 December 1909. 
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GLOBAL CHANGE 


Interglacial and future sea level 


Peter U. Clark and Peter Huybers 


A merger of data and modelling using a probabilistic approach indicates 
that sea level was much higher during the last interglacial than it is now, 
providing telling clues about future ice-sheet responses to warming. 


Predicting sea-level rise in a warming world is 
one of science’s great challenges. According to 
sea-rise projections for the twenty-first century, 
the 145 million people living within a metre of 
the present sea level risk losing their land and 
their homes. Many more would be affected 
by the resulting socio-economic disruption’. 
Our poor understanding of ice-sheet dynam- 
ics means that projecting sea-level rise beyond 
the twenty-first century is much less certain’. 
On page 863 of this issue, however, Kopp et al.’ 
derive a new assessment of sea level during 
the last interglacial, around 125,000 years ago, 
that provides insight into this question. If their 
results are correct, the sea-level rise over the 
coming century will be followed by many more 
metres of rise over the ensuing centuries. 

Increases in global sea level stem from both 
expansion of warming water (thermosteric 
change) and addition of new water from melt- 
ing ice on land (eustatic change). Predictions 
of future thermosteric changes are relatively 
well constrained compared with those of the 
eustatic change associated with melting of the 
Greenland and Antarctic ice sheets’. There is 
thus a need to better determine both 
how much and how rapidly eustatic 
sea level will rise in response to a given 
forcing effect such as anthropogenic 
global warming. 

Evidence that sea level during 
the last interglacial was 4-6 metres 
higher than at present has long been 
proposed as a possible analogue for 
the equilibrium sea-level response to 
future anthropogenic warming”*. But 
the sea-level records may include a 
local response to geophysical adjust- 
ments from the preceding glaciation, 
and thus may not accurately record 
the global sea level’. Furthermore, 
the implications of 4 or 6 m of rise are 
quite different: if sea level increases by 
only 4 m, much ofit can be reconciled 
as being due to thermosteric rise and 
partial loss of the Greenland ice sheet; 
anything more requires a contribution 
from Antarctica. 

Kopp et al.’ reach the startling con- 
clusion that, during the last interglacial, 
global sea level was at least 6.6 m above 
present, and may have reached 9.4 m, 
much higher than previous estimates. 
The implication is that both the Green- 
land and Antarctic ice sheets were 
much smaller 125,000 years ago. 
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To derive this result, Kopp et al. compiled a 
database of proxy measurements of sea level 
that includes isotopic and coral records, as 
well as other records that are less well dated. 
Although this database is more comprehensive 
than those used in previous studies, constrain- 
ing estimates for past global sea level from noisy 
and sparse data whose timing is uncertain is a 
formidable statistical problem. It is particu- 
larly difficult because one must also account 
for regionally varying geophysical effects, 
including local tectonic uplift or subsidence, 
and sea-level changes induced by gravitational, 
deformational and rotational effects associated 
with the redistribution of ice, ocean and mass 
of the solid Earth®. Using a physical model that 
includes these effects, Kopp et al. derived an 
estimate of the covariance between local and 
global sea level. They then merged the local- 
global covariance estimate with proxy estimates 
of sea level within a Bayesian framework to 
make temporally complete estimates of global 
sea level and assess their probability. 

The redistribution of mass associated with 
individual ice-sheet melting causes distinct 
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Figure 1 | Similarity of latitudinal warming (AT) during the last 
interglacial and a projection for the late twenty-first century. The 
green line summarizes proxy-data estimates of sea surface and 

air temperature during the last interglacial relative to the present 
interglacial before industrialization. Diamonds are largely sea surface 
temperatures, but include temperatures derived from polar ice cores 
and two high-latitude Northern Hemisphere pollen records. The 
temperatures reflect the interval between 120,000 and 130,000 years 
ago (mean and 1 standard deviation). The green line is a polynomial 
fit to these data. Surface air temperature estimates from less-well- 
dated pollen sites in Europe (not shown) similarly show warmer 
temperatures across most of Europe during the last interglacial’’. The 
blue solid line is the zonal mean of the projected surface temperature 
changes (with 1 standard deviation shown by dotted blue lines) 

for the late twenty-first century relative to 1980-99; it is based on 

the SRES B1 greenhouse-gas-emission scenario obtained using the 
GFDL climate model. (Palaeoclimate data are available at www.ncdc. 
noaa.gov/paleo/pubs/clark2009.) 
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spatial patterns in sea level’. In conjunction 
with the proxy measurements, Kopp et al.’ also 
used the modelled patterns to estimate that 
Greenland and Antarctica each contributed 
at least 2.5 m of sea-level rise. This estimate is 
consistent with independent constraints: the 
maximum Greenland contribution was prob- 
ably 3.4 m (ref. 10), and the thermosteric plus 
mountain-glacier and ice-cap contribution was 
probably no more than 1 m. So, if sea level was 
at least 6.6 m higher, a minimum of 2.2 m must 
have come from Antarctica. The Antarctic 
contribution would probably have come from 
the inherently unstable West Antarctic Ice 
Sheet, which locks up the equivalent of at least 
3.3 m of sea level’', so that Kopp and colleagues’ 
result implies that most, if not all, of this ice 
sheet melted about 125,000 years ago. 

Perhaps of greatest socio-economic concern 
is the possible maximum rate of sea-level rise 
in a warmer world. According to Kopp et al’, 
sea-level rise during the last interglacial was in 
the range of 6-9 millimetres per year. By com- 
parison, instrumental records indicate that the 
rate of global sea-level rise over the twentieth 
century was about 2 mm yr”. That may have 
accelerated between 1993 to 2003 to around 
3mm yr» at least in part due to an acceleration 
in mass loss from the Greenland and Antarctic 
ice sheets”. 

Why was sea level so much higher 125,000 
years ago? One possibility is that ice sheets 
have multiple potential steady states for a 
given climate'’. However, the global tempera- 
ture was apparently 1.5-2 °C warmer than the 
pre-anthropogenic global average of the past 
10,000 years (Fig. 1), despite there 
being essentially no difference in 
atmospheric greenhouse-gas concen- 
trations. Climate models have simu- 
lated a strong Northern Hemisphere 
summer warming in response to 
Earth's more eccentric orbit during the 
last interglacial, but almost no change 
in the Southern Hemisphere”. South- 
ern warming may then have occurred 
through an oceanic teleconnection 
with the north”, or through changes 
in the duration of the Southern Hemi- 
sphere summer’, with accompanying 
feedbacks amplifying this warming. 

In any event, the latitudinal dis- 
tribution of warming seems to be 
remarkably similar to the global tem- 
perature response to carbon dioxide 
under a commonly used scenario for 
greenhouse-gas emissions (compare 
the green and blue lines in Fig. 1). 
This suggests that the climate of the 
last interglacial might, by coinci- 
dence, provide a reasonable analogue 
for establishing ice-sheet sensitivity 
to global warming. Assuming that 
Kopp and colleagues’ estimates are 
accurate, and that higher sea level 
resulted from higher temperatures, 
the disconcerting message is that the 
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equilibrium response of sea level to 1.5-2 °C 
of global warming could be an increase of 
7-9 metres. | 
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DNA REPAIR 


A heavyweight joins the fray 


Simon J. Boulton 


Tagging of DNA-damage-associated proteins by ubiquitin is key to 
coordinating the DNA-damage response. The ubiquitin-related protein 
SUMO is revealed as a crucial regulator of ubiquitylation in DNA repair. 


Ubiquitylation — the attachment of ubiquitin 
groups to cellular proteins — was initially 
characterized by its role in promoting protein 
destruction. However, we now know that the 
consequences of ubiquitylation are diverse, 
and that it affects many cellular systems. The 
ubiquitin modification comes in many flavours 
(addition of a single ubiquitin molecule, for 
example, or of polyubiquitin chains that 
differ in the position of the linkage between 
ubiquitin molecules), and the various types of 
ubiquitylation can alter the fate of target pro- 
teins in different ways. In addition, the cell has 
ubiquitin-related modifiers, such as the SUMO 
proteins, that also alter protein fate or function 
after conjugation’. One process that has been 
inextricably linked to ubiquitylation is the 
cellular response to DNA damage. Although 
studies** had suggested a link between the 
DNA-damage response and the SUMO path- 
way, proof that SUMOylation is important 
for DNA repair had remained elusive. In this 
issue, two groups, Morris et al.* (page 886) and 
Galanty et al.° (page 935), now provide good 
evidence that SUMO functions together with 
ubiquitin to coordinate DNA repair. 

DNA double-strand breaks (DSBs) result in 
the recruitment and activation of the protein 
kinases ATM, ATR and DNA-PK, which phos- 
phorylate target proteins, such as the variant 
histone H2AX. The phosphorylated proteins 
then promote the recruitment of other DNA- 
repair proteins to DSBs’, including MDC1 
(mediator of the DNA-damage checkpoint), 
53BP1 and the E3 ubiquitin ligases RNF8, 
RNF168 and BRCAI (ref. 6), which catalyse 
ubiquitylation events’ at DSBs. (Conjugation 
of ubiquitin or related modifiers to target pro- 
teins requires an E1 activating enzyme, an E2 
conjugating enzyme and an E3 ligase.) 


To investigate the involvement of the SUMO 
pathway in the DNA-damage response, Morris 
et al.’ and Galanty et al.’ analysed the subcellu- 
lar localization of SUMO-pathway components 
in mammalian cells. Both groups’ report 
that the El SUMO-activating enzyme SAE1, 
the E2 SUMO-conjugating enzyme UBC9, 
and the three forms of vertebrate SUMO pro- 
tein, SUMO1 and the closely related SUMO2 
and SUMO3 (SUMO2/3), are recruited to 
DSBs. 

The authors** used RNA interference and 
fluorescence microscopy to show that the 
SUMO E3 ligases PIAS1 and PIAS4 are re- 
sponsible for SUMOylation events at DSBs. 
Depletion of PIAS1 impaired accumulation 
of SUMO2 and SUMO3 (but not SUMO1) at 
DSBs, whereas depletion of PIAS4 impaired 
recruitment of SUMO1 and SUMO2/3. Fur- 
thermore, recruitment of 53BP1 to DSBs 
depended on PIAS4, whereas recruitment of 
BRCAI depended on both PIAS1 and PIAS4. 
Is SUMOylation necessary for DSB repair? The 
answer is, emphatically, yes — cells lacking 
PIAS] or PIAS4 showed defects in DSB repair 
and were also highly sensitive to DSBs caused 
by ionizing radiation. 

What are the targets of the SUMO pathway 
during the DNA-damage response? Prompted 
by a study showing interaction between 
UBC9 and BRCA1 in the nematode worm 
Caenorhabditis elegans’, both groups” inde- 
pendently showed that BRCA1 is SUMOylated 
during the DNA-damage response in a PIAS1- 
and PIAS4-dependent manner (Fig. 1). Deple- 
tion of PIAS] and PIAS4 impaired recruitment 
of BRCA1 to DSBs**, significantly impaired 
ubiquitylation at DSBs, and reduced ubiq- 
uitylation of the histones H2A and H2AX; 
the latter process has been shown to require 
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Figure 1 | Ubiquitylation and SUMOylation at 
DSBs. Double-strand DNA breaks (DSBs) result 
in the recruitment of DNA-repair proteins, 
including 53BP1 and the E3 ubiquitin ligases 
RNE8, RNF168 and BRCAI. Morris et al.’and 
Galanty et al.’ observe that the SUMO-pathway 
components UBC9-PIAS4 and UBC9-PIAS1 
also accumulate at DSBs, where they catalyse 
the SUMOylation of 53BP1 and BRCA1 (and 
possibly RNF8 and RNF168). SUMOylation 
stimulates BRCA1 E3 ubiquitin-ligase activity, 
leading to ubiquitylation of target proteins at 
DSBs, including the histone H2A and its variant 
H2AX. H2A and H2AX are also substrates 

for ubiquitylation by RNF8 and RNF168, as is 
RAP80, a ubiquitin-binding protein that also 
interacts with BRCA1. RNF8 and RNF168 
catalyse the formation of lysine-63-linked 
ubiquitin chains, whereas BRCA1 and its E2 
conjugating enzyme catalyse the formation of 
lysine-6-linked ubiquitin chains. $, SUMO; Ub, 
ubiquitin. Red arrows indicate SUMOylation; 
black arrows indicate ubiquitylation. 


the ligase activities of RNF8, RNF168 and 
BRCAI (ref. 7). Galanty et al.’ also showed that 
53BP1 is SUMOylated and that this affects its 
retention at DSBs. 

RNF8 and RNF168 catalyse the formation 
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of lysine-63-linked ubiquitin chains, whereas 
BRCAI catalyses formation of lysine-6- 
linked ubiquitin chains’ ( Fig. 1). Morris et al.’ 
exploited this difference in ubiquitin-chain 
linkage to pinpoint the effects of PIAS proteins 
on BRCA1 activity. They showed that over- 
expression of BRCA1 increased ubiquityla- 
tion events in cells; these events were reduced 
following PIAS1/4 depletion. Co-localization 
of lysine-6-linked ubiquitin chains with DSBs 
was also impaired in BRCA1-, PIAS1- or 
PIAS4-depleted cells. Furthermore, muta- 
tion of the two consensus SUMO-conjugation 
sites in BRCA1 reduced SUMO1 association 
and BRCA1-dependent ubiquitylation. Thus, 
the authors propose that BRCA1 is aSUMO- 
regulated ubiquitin ligase (Fig. 1). 

These findings raise several immediate 
questions. Are the activities of RNF8, RNF168 
and/or other E3 ubiquitin ligases also regulated 


by SUMOylation? Certainly, such a scenario is 
possible for RNF8. The current studies** found 
that, although recruitment of RNF8 to DSBs 
was unaffected by PIAS1/4 depletion, RNF8 
could not ubiquitylate DSBs, suggesting that 
it may be inactive in the absence of PIAS1/4. 
How does SUMOylation stimulate the E3 
ubiquitin-ligase activity of BRCA1? Previ- 
ous studies’ have shown that DNA damage 
promotes association between BRCA1 and 
its E2 conjugating enzyme to form an active 
E3 ubiquitin ligase. It is therefore tempting to 
speculate that SUMOylation induces a confor- 
mational change in BRCA1 that enhances its 
binding to an E2 conjugating enzyme. It is clear 
from the current studies that SUMOylation 
functions at multiple levels during the DNA- 
damage response and this will provide fertile 
ground for future research. The discovery 
that the SUMO pathway is important for 


ubiquitylation at DSBs raises the possibility that 
SUMOpylation may activate other ubiquitylation 
events in the cell. | 
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An AWN 


Soggy origami 


Vincent H. Crespi 


Flat microstructures can be designed to spontaneously fold into 
three-dimensional shapes. Computer simulations of water droplets on 
sheets of carbon atoms now extend this concept to the nanometre scale. 


Explanations of nanometre-scale phenomena 
often require strange bedfellows of scientific 
concepts and terminology. The work reported 
by Patra et al.' in Nano Letters nicely illustrates 
this trend by marrying chemistry, fluid mechan- 
ics, mechanical engineering and physics. 
The authors have used molecular dynamics 
simulations to show that the catalytic action 
of nanodroplets of fluids can cause a simple 
object — an atomically thin layer of carbon 
atoms, known as a graphene sheet — to fold 
spontaneously into complex shapes. The rami- 
fications of this scientific poly- 
gamy extend beyond the four 
fields mentioned above: such 
spontaneous folding evokes the 
behaviour of proteins, and graph- 
ene sheets also hold promise 
for electronic applications. 
Graphene sheets are single 
layers of graphite (the familiar 
stuff in pencils), in which the 
carbon atoms are arranged ina 
honeycomb pattern. When Patra 
et al. simulated a small droplet 
of water sitting on such a sheet, 
they found that, rather than 
simply sitting still, the droplet 
actively deforms the graphene 
membrane. Because atoms at 
exposed surfaces in materials 
are less stable than those bur- 
ied deeper within, the authors’ 
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droplet-membrane system collectively 
deforms to minimize the number of exposed 
atoms and hence lower the system's energy. The 
precise deformation depends on the shape of 
the graphene sheet and the diameter of the 
droplet, but, in general, the sheet wraps up 
around the droplet to form folds, scrolls or 
troughs (Fig. 1a). When a droplet interacts 
with a sheet shaped like an open, four-petalled 
flower, the petals fold up around the droplet to 
form a closed bulb (Fig. 1b). 

In principle, a graphene membrane without 


ise 


Qu 


Figure 1| Folding sheets. Patra and colleagues’ molecular dynamics 
simulations' reveal that droplets of water cause atomically thin layers of 
graphite (graphene sheets) to fold up into more compact shapes. a, This 
simulation shows that a nanodroplet of water molecules (red) causes the free 
end ofa graphene ribbon 30 x 2 nanometres in size to wrap around it. The two 
opposing sheets then slide along each other, folding the ribbon in half. b, Here, 
a droplet of water causes the ‘petals’ of a flower-shaped graphene ribbon about 
13 nanometres across to fold up around it. (Images taken from ref. 1.) 
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a droplet could also lower its energy by bend- 
ing, so that some of the surface carbon atoms 
become buried in the interior of a more com- 
pact shape. But the transition of an exposed, 
flat membrane into a smaller, folded package 
is problematic for an isolated sheet: the energy 
cost of forming intermediate, partially curled 
shapes is not compensated for by the short- 
range attraction between the approaching 
surfaces until the sheet has bent enough for 
the surfaces to touch. 

This is where the nanodroplets come in. 
Patra and colleagues’ study’ shows that fluid 
droplets act as catalysts for graphene defor- 
mation — they remove the energy barrier that 
prevents folding reactions, without themselves 
undergoing any structural changes. Remark- 
ably, after a droplet has done the work of 
folding the sheet, it can be expelled from the 
resulting structure as the opposing graph- 
ene surfaces press tightly against each other. 
Apparently, graphene surfaces prefer being wet 
to being naked, but they prefer the contact of 
other graphene surfaces even more. 
Asa result, fluid nanodroplets can 
cleanly convert flat graphene sheets 
into folded bilayers, then leave 
gracefully after their work is done. 
Such bilayered graphene systems 
are interesting in their own right as 
they profoundly alter the remarkable 
electronic properties of graphene’. 

Spontaneous curling or folding is 
not unique to soggy graphene. It is 
also seen in a variety of other sys- 
tems, ranging from micromachines 
to biomolecules. The spontaneous 
deformation of flat sheets was a 
nuisance in early work on micro- 
machines that were etched from 
silicon or related materials. When 
thin sheets of material were released 
from an underlying substrate, they 
would curl up in an uncontrolled 
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fashion to release previously hidden internal 
stresses in the material, thus destroying the 
carefully planned geometry of the desired 
machine. 

The unwanted deformations of micro- 
machines were subsequently brought under 
control, and even turned to good use, by delib- 
erately engineering stresses into materials to 
generate a preferred direction for bending. 
This can be achieved by considering the atomic 
structures of the materials. Every crystalline 
solid has its own preferred spacing between 
constituent atoms — the atoms in germanium, 
for example, are more widely spaced than those 
in silicon. If two sheets of different crystalline 
materials are layered to form a thin bilayer 
sheet in which the atoms across the interface 
are mutually aligned, then the layer that prefers 
a larger inter-atomic spacing is placed under 
compression, whereas the other layer is placed 
under tension. These internal strains can be 
relaxed if the sheet spontaneously curls away 
from the compressed side. Such systems can be 
designed to bend or curl into desired shapes, 
such as scrolls, spirals and even pop-up struc- 
tures**. By contrast, there is no way to build 
such stresses into a single layer of graphene. 
Patra and colleagues’ findings’ circumvent this 
problem: the interactions of the graphene sheet 
with the liquid drop define the way that the 
sheet will curl. 

The interactions of fluids with materials have 
also been exploited at the micrometre scale to 
create spontaneously folding structures. In 
these cases, the size of the structures allows 
the use of photolithography — a finely honed 
technique best known for sculpting integrated 
circuits out of silicon — to precisely define the 
geometry of initially flat shapes, which sub- 
sequently fold when regions of solder within 
them are melted. The natural tendency of the 
solder droplets is to minimize their exposed 
surface area; in doing so, they induce the struc- 
tures to pull themselves into more compact, 
three-dimensional objects such as cubes or 
tetrahedra’. Patra and colleagues’ simulations’ 
potentially extend this ingenious technique to 
objects a hundred-fold smaller. 

The three-dimensional structures of poly- 
meric biomolecules, such as proteins and 
DNA, are formed by similar, exquisitely precise 
folding of one-dimensional chains. Humans 
have learned to exploit this phenomenon, 
particularly in the practice of DNA origami, 
wherein specific interactions between com- 
plementary DNA strands are programmed 
to interweave a backbone helix into a desired 
shape through the incorporation of so-called 
staple strands®”. The folding of proteins, by 
contrast, is governed at the crudest level by the 
tendency of hydrophobic (water-repellent) 
regions to curl up within the interior of folded 
protein structures. In this way, hydrophobic 
protein domains become shielded from their 
watery environment by hydrophilic (water- 
attracting) regions of the same protein strand. 
Could hydrophobic or hydrophilic groups be 


engineered into graphene to modulate its fold- 
ing, so extending the one-dimensional lessons 
of biology to the two-dimensional world of 
graphene? The jury is still out, but Patra and 
colleagues’ preliminary work’ certainly opens 
up investigations of this idea. 

The quantitative details of Patra and co-work- 
ers empirical simulations — particularly those 
concerning subtle sheet-fluid and sheet-sheet 
interfacial interactions — merit verification 
by more precise methods. For many practical 
applications, graphene would lie on a substrate, 
and so it would also be useful to incorporate 
sheet-substrate interactions into a second gen- 
eration of simulations. But the fundamental 
physics described by Patra and colleagues’ mod- 
els is undoubtedly correct. Experimental valida- 
tion of their findings is the next obvious step. 
The availability of a wide assortment of fluids 


suggests that the physical balance of fluid-sheet 
and fluid-fluid interactions required to bring 
about graphene origami should be possible in 
the real world. | 
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NEUROSCIENCE 


New tricks and old spines 


Noam E. Ziv and Ehud Ahissar 


Imaging of brain structures in living mice reveals that learning new tasks 
leads to persistent remodelling of synaptic structures, with each new skill 
associated with a small and unique assembly of new synapses. 


The notion that structural changes in brain 
circuitry underlie certain forms of learning is 
widely accepted, yet this belief has been frus- 
tratingly difficult to establish experimentally. 
Two studies, one by Xu et al.’ (page 915) and 
one by Yang et al.’ (page 920) published in 
this issue, provide compelling evidence that 
learning new motor tasks (and acquiring new 
sensory experiences) is associated with the 
formation of new sets of persistent synaptic 
connections in motor (and sensory) regions 
of the mouse brain. These findings suggest that 
synapse assemblies, rather than cell assemblies, 
might be viewed as the elementary entities 
(engrams) of stored memories. 

Ata basic level, the brain can be viewed as 
a vast network of neurons connected to each 
other by specialized structures known as syn- 
apses. Most synaptic connections are formed 
between axons (the slender and elongated 
extensions that carry the signals generated 
by neurons) and dendrites (highly branched 
extensions that are specialized for receiving 
signals originating in other neurons). Most 
axodendritic synapses are found on dendritic 
spines, which are tiny protrusions that extend 
from dendritic shafts in large numbers (Fig. 1a, 
overleaf). 

It has long been assumed that structural 
changes in this complex circuitry provide the 
basis for long-term memory formation or the 
learning of new tasks*. The development of 
new imaging tools’ over the past decade has 
opened the door to an experimental evaluation 
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of this assumption as it has allowed for 
longitudinal studies of axonal and dendritic 
morphology in the brain of living animals 
(mainly mice). Somewhat surprisingly, it has 
become evident that overall neuronal mor- 
phology is remarkably stable over long periods 
(months and beyond). What such studies have 
revealed, however, is that structural changes 
do occur at the level of individual synapses, 
manifested by the appearance of new dendritic 
spines and the disappearance of others over the 
course of hours and days*”®. Interestingly, the 
extent of spine remodelling can be altered by 
experimental manipulations, such as depriv- 
ing the animals of sensory input (from the 
whiskers or eyes)*”®. Yet a direct relationship 
between spine remodelling and learning had 
yet to be demonstrated. 

The two studies in this issue’* provide 
strong evidence for this link. In both studies, 
mice were trained to perform a new motor task 
(reaching for a single seed' or remaining on 
an accelerating rotating rod’), and two-photon 
microscopy’ of the living animals was used to 
investigate whether successful training was 
associated with changes in spine remodelling 
beyond those observed in untrained mice. 
Both studies revealed that by the end of the first 
1-2 days of training (and as soon as one hour 
after training’), twice as many new spines had 
appeared in the brain of trained mice compared 
with untrained mice. Continuous training 
was subsequently followed by increased rates 
of spine elimination, and so, after 1-2 weeks, 
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Figure 1| Structural remodelling associated with 
motor task learning’”. a, Several axons can form 
en passant synapses (synapses formed along 
axons rather than at their tips) with dendritic 
spines extended from a nearby dendrite. Synaptic 
vesicles contain neurotransmitters secreted 

at these synapses. b, The acquisition of new 
motor task A is associated with the formation 

of new spines. This can occur by the selective 
stabilization of a dendritic filopodium that had 
contacted a nearby axon beforehand (spine 1) 

or by the direct emergence of a new spine 

that extends towards a nearby axon (spine 2). 
Motor task acquisition is then followed by the 
elimination of other spines (3). ¢, Acquisition 

of a second motor task (task B) is followed by 
the formation of an additional set of new spines 
(4) and the elimination of others (5), without 
significantly affecting the spines formed during 
the acquisition of task A. 


total spine numbers did not differ between 
trained and untrained animals. Remarkably, 
behavioural performance correlated well with 
the numbers of new spines formed shortly after 
training (and with the extent of pre-existing- 
spine elimination) when these parameters were 
compared across all trained animals. 

Yang and colleagues’ also examined how 
exposure to an enriched environment (alter- 
ing patterns of bead strings hanging from cage 
tops) affected spine remodelling. In essence, 
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spine remodelling was affected in the same 
manner. Here, however, remodelling was 
confined to particular regions of the cerebral 
cortex concerned with sensory input from the 
whiskers, unlike spine remodelling associated 
with motor task learning, which was confined 
to specific cerebral cortex regions concerned 
with forelimb movement’”. 

Although these findings strongly support 
the notion that some forms of learning are 
‘encoded’ by changes in brain circuitry, several 
points warrant further discussion. In agree- 
ment with other studies***, most (96-98%) of 
the new spines in both trained and untrained 
animals were short-lived (days), with <1% 
persisting for many months’. Therefore, only 
a tiny fraction of the new spines formed over a 
2-day period would be expected to survive for 
long durations. Yang and colleagues’ calculated 
that at the end of the mouse’s life (assumed to 
be 36 months) these new spines would make up 
0.04% of the total number of spines in the par- 
ticular cortical areas examined. Although this 
tiny fraction is suggested to be a sufficiently 
large number of spines to encode a learned 
behaviour, one wonders how baseline changes 
in spine number, shown in the same studies’” 
to be about 3-5% per day (two orders of mag- 
nitude greater), can occur without some loss of 
coherent brain function. 

One possible explanation might be that most 
short-lived spines are only weakly functional”. 
It is intriguing to speculate that this large pool 
of weak synapses could serve as a substrate for 
selective processes that would promote the 
stabilization and maturation of a minority of 
such synapses according to certain functional- 
ity criteria. These synapses would then survive 
and eventually underlie future behaviour. In 
this respect it is worth mentioning that training 
was not associated with the additional prolifer- 
ation of dendritic filopodia’” (often considered 
to represent spine precursors’) but was asso- 
ciated with increased conversion of filopodia 
to spines (Fig. 1b, spine 1) and the subsequent 
stabilization of the converted structures’, which 
would be congruent with such selective*”"® 
processes. As compelling as this possibility is, 
however, the data are also consistent with the 
possibility that new spines formed through 
instructive processes, for example, by spine 
extension at particular locations in response 
to specific cues (Fig. 1b, spine 2). 

Of particular interest is the finding that 
training-associated spine remodelling was not 
observed in mice that failed to learn the reach- 
ing task or could not reach the seeds', or in 
animals exposed to a slowly rotating rod task” 
(which does not require learning). Whereas the 
latter case can be explained by the failure of new 
spines to satisfy a certain short-term function- 
ality criterion (such as causally linked pre- and 
postsynaptic activities’), the former can only 
be explained by a behaviourally driven spine 
formation and stabilization criterion, operat- 
ing at longer timescales (hours and days, rather 
than minutes). The most likely criterion would 
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be a reward-related one’” — spines would be 
stabilized only if their functionality resulted in 
rewarded actions. 

This strongly implicates the involvement 
of diffuse modulatory brain systems — small 
groups of neurons located in specific brain 
regions (mainly the brainstem), whose projec- 
tions cover large brain regions"’. Most notable 
in this respect is the dopaminergic system, 
whose activation has been shown time and 
again to have key roles in various forms of rein- 
forcement, or reward-driven learning. Indeed, 
it was recently reported” that the elimination 
of dopaminergic terminals within the same cer- 
ebral cortex regions concerned with forelimb 
movement specifically impaired the acquisition 
ofa motor skill (reaching for a food pellet, a task 
similar to that studied by Xu and colleagues’) 
but not the execution of a previously acquired 
skill. Novelty, another attribute ascribed to 
these modulatory brain systems, also seemed 
to be extremely important, as re-exposure of 
trained animals to the original training tasks 
did not lead to particular spine remodelling, 
whereas learning new motor tasks did’”. 

Given the sparseness of the task-related 
synapse groups”, it is not surprising that 
Xu et al. found that the overlap between syn- 
apse groups related to two different tasks was 
negligible (Fig. 1c). This finding, the fact that 
most task-related remodelling involved new 
spines, and the lack of clear size changes in 
pre-existing spines following training’, would 
seem to be at odds with certain tenets of the 
cell-assembly hypothesis of memory'"”. 
The cell-assembly hypothesis assumes that what 
distinguishes one memory trace from another 
is the composition of cells that are co-activated 
within a given network. Most notably, this 
hypothesis posits that these distinct composi- 
tions are created during memory formation by 
retuning the strengths of all the synapses already 
in the network, along a continuum of synaptic 
strengths. The new results, however, point to 
strong associations between learned tasks and 
specific, non-overlapping groups of novel syn- 
apses whose strength might remain relatively 
constant once formed. These ‘synapse assem- 
blies’ might thus be viewed as the fundamental 
engrams for these specific motor tasks. Synapse 
assemblies in a given network would hold the 
keys to the functioning of that network; for every 
learned task, the network would function with a 
connectivity scheme that had been formed and 
optimized specifically for that task. 

Clearly, further studies are required to deter- 
mine whether the experience-associated spine 
remodelling described here can be generalized 
to other forms of memory and other brain sys- 
tems. Investigating the involvement of diffuse 
modulatory systems in the process is another 
intriguing challenge. And although it remains to 
be shown conclusively that these forms of spine 
remodelling are essential components of long- 
term learning and not merely distant echoes 
of other, yet to be discovered processes, these 
exciting studies make a convincing case for a 
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structural basis to skill learning and reopen the 
field for new theories of memory formation. ™ 
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MICROSCOPY 


Photons and electrons team up 


F. Javier Garcia de Abajo 


An imaging technique has been demonstrated that blends the principles 
of conventional light and electron microscopy. It renders images with 
nanometre and femtosecond space-time resolution. 


How can we probe light fields in the vicinity 
of nanostructures? This question is becoming 
increasingly relevant as the confinement and 
steering of light at the nanoscale are gather- 
ing momentum in applications such as opti- 
cal sensing and information processing. On 
page 902 of this issue, Barwick et al.' provide 
a practical answer to this question. They report 
a microscopy technique that probes such fields 
by focusing both light and electron pulses on 
the nanostructures under study. 

In the new imaging technique, which Bar- 
wick and colleagues baptized photon-induced 
near-field electron microscopy (PINEM), a 
specially designed electron microscope is used 
to project magnified images of nanostructures 
in much the same way as an overhead projector 
forms images of slides from a light beam pass- 
ing through them. In PINEM, the beam is made 
of electrons and the role of the slides in the pro- 
jector is played by light trapped in the vicinity of 
the samples (see Fig. 4 on page 905). 

The term ‘trapped’ means that the light waves 
do not propagate and decay exponentially in 
intensity with distance from the sample. Unlike 
freely propagating light, this trapped light, 
called an evanescent light field, can interact 
efficiently with the electrons, which gain or 
lose energy through the absorption or emis- 
sion of light quanta (photons). By means of 
energy-filtering, the microscope subsequently 
selects and collects only those electrons that 
have undergone energy gain, forming images 
in a process that retains the sub-nanometre 
spatial resolution that is characteristic of con- 
ventional transmission electron microscopes. 
The number of collected electrons is propor- 
tional to the strength of the evanescent field. 

Travelling at 70% of the speed of light, the 
electrons used by Barwick et al.’ spend only 
a fraction of a femtosecond (1 femtosecond 
is 10°'° seconds) near their 100-nanometre- 


thick samples (carbon nanotubes or silver 
nanowires). To observe a sizeable, and useful, 
electron-light interaction during such a short 
time interval requires intense light fields. In 
their experiment, the authors achieved high- 
intensity fields by using two synchronized 
femtosecond light pulses: one of the pulses 
was directed to the microscope’s electron gun, 
which converted it into an electron pulse via 
photoemission; the other, with a peak inten- 
sity of about 10-100 gigawatts per square 
centimetre, was aimed at the nanostructure 
and was capable of producing multiple-pho- 
ton absorption (or emission) events by each 
passing electron — up to eight photons, as the 
authors report’. By design, both pulses have a 
similar temporal duration, of the order of a few 
hundred femtoseconds (this duration defines 
the time resolution of PINEM), and their rela- 
tive delay in the time of arrival at the sample 
was controlled with femtosecond precision 
through the difference in optical path length 
between the two original light pulses. 

Light-electron interactions similar to those 
seen in Barwick and co-workers’ experiment 
have been observed previously — for exam- 
ple, when electrons pick up thermal phonons 
(quanta of atomic lattice vibrations) in insu- 
lator films’ or plasmons (quanta of collective 
oscillations of conduction electrons) from a 
metal surface’, or when electrons interact with 
evanescent fields reflected from an illuminated 
diffraction grating’. Barwick et al. are the first 
to exploit such interactions to image an evanes- 
cent light field in the vicinity ofa nanostructure, 
and they accomplish that with nanometre and 
femtosecond space-time resolution (see Fig. 2 
on page 903). 

Evanescent light fields have been probed 
previously with near-field optical microscopes” 
that rely on scanning a subwavelength-sized 
tip over the sample to form images. These 
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microscopes can yield femtosecond time 
resolution but are limited by the size of the 
tip to tens of nanometres in spatial resolution. 
In addition, the tip can produce undesired 
artefacts in the images. By contrast, Barwick 
and colleagues’ PINEM technique achieves the 
sub-nanometre spatial resolution that electron 
microscopes do. What’s more, the electrons 
constitute a relatively ‘cleav probe: moderate 
electron-beam intensities cause only marginal 
perturbations in the sample, thus allowing 
faithful imaging. 

Barwick et al. demonstrated their PINEM 
technique for light pulses lasting about 
220 femtoseconds, and observed the real-time 
evolution of the evanescent field that mimics 
the light pulses themselves. With 220-femto- 
second pulses, one could also investigate the 
dynamics of optical excitations in the sample 
that have comparable or larger lifetimes, such 
as certain long-lived photon states that occur in 
insulating structures (for example, ‘Mie modes’ 
in silicon cavities). However, impressive as it is, 
the technique needs to be adapted for shorter 
light pulses that can follow the ultrafast dynam- 
ical optical response of many nanostructures of 
interest, such as metallic nanoparticles whose 
plasmons typically live for only a few tens of 
femtoseconds. 

The propagation of light fields along the 
surface of a nanostructure is a key ingredient 
of nanophotonic devices, which carry and pro- 
cess optical signals®. The PINEM technique 
could be improved to study such propagation 
by sampling the evanescent decay of such fields 
along the direction perpendicular to the sam- 
ple surface. This and other developments will 
surely show that, with Barwick and colleagues’ 
electrons gaining energy, the scientific com- 
munity will also gain new means of observing 
the nanoworld. o 
F. Javier Garcia de Abajo is at the Institute of 
Optics, CSIC, Serrano 121, 28006 Madrid, Spain. 
e-mail: jga@cfmac.csic.es 
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OBITUARY 


Claude Lévi-Strauss (1908-2009) 


Leading anthropologist of his generation. 


Claude Lévi-Strauss had only the slightest 
experience of ethnographic fieldwork, and 
had no formal training in anthropology. 
Nevertheless, his ideas transformed the 
discipline, and profoundly influenced the 
other human sciences. He died on 31 October. 

The son of an artist father, and the grandson 
ofa rabbi in Strasbourg, France, Lévi-Strauss 
was educated in law and philosophy at the 
Sorbonne in Paris. In 1935, he joined a French 
contingent at the new University of Sao Paulo 
in Brazil. He embraced the opportunity, 
because he had a very particular ambition. 
Jean-Jacques Rousseau once suggested that 
an expedition should be sent to the Americas 
to study human nature in its essential state, 
uncorrupted by civilization. A devotee of 
Rousseau’ philosophy, Lévi-Strauss was 
determined to execute the master’s plan. 

During his university holidays, Lévi-Strauss 
made expeditions to study remote Amerindian 
settlements, but almost all of this work was 
done at telegraph posts where Indians were in 
contact with government agencies and traders. 
Only towards the end of his travels did he 
make brief contact with an isolated band living 
in the old style, but naturally enough they 
spoke no Portuguese. “Alas! They were only 
too savage,’ Lévi-Strauss reported. “They were 
as close to meas a reflection in a mirror; 

I could touch them, but I could not 
understand them.” 

This adventure might have been no more 
than an interlude. But shortly after Lévi-Strauss 
returned to France, the German army invaded 
the country and he became an exile in New 
York. At aloose end, he wrote up his Brazilian 
field notes, became an informal member 
of the anthropology circle at Columbia 
University and spent long days combing the 
anthropology shelves in the New York Public 
Library. “What I know of anthropology I 
learned during those years,’ he later remarked. 

Although his American colleagues were 
steeped in regional ethnography, they 
were notoriously suspicious of theoretical 
abstractions. Lévi-Strauss, however, was 
determined to use observations of hunter- 
gatherers as the basis for a theory of human 
nature, like a more empirical Rousseau. Such 
a soaring ambition required a new theoretical 
framework. Inspiration came from a fellow 
exile in New York, the Russian linguist Roman 
Jakobson. Following the path-breaking 
contributions of Ferdinand de Saussure, 
linguistics was in the throes ofa theoretical 
revolution. Lévi-Strauss concluded that it 
would show the way to a new, generalizing, 
structural anthropology. 

Jakobson was particularly interested in 
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phonemics, the branch of linguistics that deals 
with the communication of meaning through 
sounds. He claimed to have split the atom of 
linguistics, the phoneme. The phoneme had 
been viewed as the smallest significant unit of 
sound in speech, but according to Jakobson 

it was itselfa bundle of features made up of 
pairs of contrasting elements. So, for instance, 
English speakers invest the contrasting b and p 
sounds with meaning (the words ‘bill’ and ‘pill 
are obviously different to our ears), whereas 

in other languages the distinction may be 
unmarked and unheard. Lévi-Strauss 

argued that systems of classification are 
constructed ona similar pattern of binary 
oppositions. 

Returning to Paris in 1949, Lévi-Strauss 
found employment at the Museum of Man 
and then the Ecole Pratique des Hautes Etudes 
at the Sorbonne. His massive doctoral thesis, 
published that year, argued that the imposition 
ofa taboo on incest marked the break between 
a natural and a cultural order, obliging men 
to exchange sisters with other men, and so 
creating family and kinship networks. In 
the simplest (and implicitly oldest) systems, 
these networks are structured by a binary 
classification of relatives into two classes: 
unmarriageable kin and marriageable affines. 

In 1959, Lévi-Strauss was appointed toa 
chair at the College of France in Paris. He 
now began to publish his most influential 
studies, which dealt with systems of thought. 
The Savage Mind (1962) argued that Native 
Americans, and other hunter-gatherers from 
Australia to Africa, operate a “logic of the 
concrete” — they order images taken from 
the world around them in a series of binary 
oppositions. This home-made natural science 
provides metaphors for social relationships. 
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In many languages, for example, the Sun 

and the Moon are associated with male and 
female characteristics. Paired species of 

birds and animals are contrasted in terms 

of colouring, feeding habits, or some other 
defining features, and are then associated with 
other paired objects (Australian Aboriginals, 
for instance, associate the Sun with the crow 
and the eaglehawk with the Moon). Such 
categorizations symbolize the social contrasts 
between men and women, or between pairs of 
human clans or occupations. 

In his masterpiece — a collection of four 
volumes on American mythology, beginning 
with the The Raw and the Cooked in 1964 
and culminating with The Naked Man in 
1971 — Lévi-Strauss demonstrated how 
systems of classification are put to work in 
myths that are at once epics, moral treatises 
and accounts of the world. He proposed 
that the initial premises of myths (say, that 
women are lunar, men solar) are played with 
in subsequent versions to yield new premises. 
These transformations follow implicit rules 
that allow only a sort of logical progression, in 
the form of the inversion of the initial terms, 
or a series of substitutions by which one binary 
pair replaces another. As he put it: “The kind 
of logic in mythical thought is as rigorous as 
that of modern science, and... the difference 
lies, not in the quality of the intellectual 
process, but in the nature of things to which 
itis applied” 

Lévi-Strauss notoriously claimed that he 
hada neolithic intelligence, that his thought 
was intuitively sympathetic to that of hunter- 
gatherers. His grand theory rested on a binary 
opposition between nature and culture (taken, 
of course, from Rousseau), and so between 
the small-scale, technically simple societies 
that existed during the first 150,000 years 
of human history and modern civilization, 
which rendered the simpler societies obsolete. 
He believed that humanity in its natural 
condition was adapted to the environment, 
whereas civilized societies endanger the 
environment and obliterate cultural variation. 
This deeply pessimistic view was conditioned 
by the Amazonian idyll of his youth, and the 
European catastrophe of the Second World 
War that followed. But he also believed that 
people everywhere ultimately thought in the 
same way, although about different things, 
and that the clash of cultures is necessary for 
human adaptation. 

Adam Kuper 

Adam Kuper is visiting professor at the 
Department of Anthropology, Yale University, 
New Haven, Connecticut 06520, USA. 
e-mail: adam.kuper@googlemail.com 
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Probabilistic assessment of sea level 
during the last interglacial stage 


Robert E. Kopp”, Frederik J. Simons’, Jerry X. Mitrovica’, Adam C. Maloof! & Michael Oppenheimer’? 


With polar temperatures ~3-5 °C warmer than today, the last interglacial stage (~125 kyr ago) serves as a partial analogue 
for 1-2 °C global warming scenarios. Geological records from several sites indicate that local sea levels during the last 
interglacial were higher than today, but because local sea levels differ from global sea level, accurately reconstructing past 
global sea level requires an integrated analysis of globally distributed data sets. Here we present an extensive compilation of 
local sea level indicators and a statistical approach for estimating global sea level, local sea levels, ice sheet volumes and their 
associated uncertainties. We find a 95% probability that global sea level peaked at least 6.6 m higher than today during the 
last interglacial; it is likely (67% probability) to have exceeded 8.0 m but is unlikely (33% probability) to have exceeded 
9.4 m. When global sea level was close to its current level (=—10 m), the millennial average rate of global sea level rise is 
very likely to have exceeded 5.6 mkyr ' but is unlikely to have exceeded 9.2 mkyr ‘'. Our analysis extends previous last 
interglacial sea level studies by integrating literature observations within a probabilistic framework that accounts for the 
physics of sea level change. The results highlight the long-term vulnerability of ice sheets to even relatively low levels of 


sustained global warming. 


As a result of industrial activity, greenhouse gas concentrations now 
exceed levels reached on Earth at any time within the past 800 kyr 
(ref. 1). Given a climate sensitivity of 2-4.5 °C per doubling of carbon 
dioxide levels*, current greenhouse gas concentrations—without 
considering any further increases—are sufficient to cause an equilib- 
rium warming of 1.4-3.2°C. Among the many effects expected to 
accompany this warming is a rise in global sea level (GSL)’, which is 
defined as the mean value of local sea level (LSL) taken across the 
ocean. This rise is driven primarily by thermal expansion of sea water 
and by melting land ice. Uncertainties in ice sheet behaviour make it 
difficult to predict sea level rise using prognostic models, but by the 
end of the twenty-first century, GSL could exceed today’s value by 
more than one metre (refs 3, 4). As changes of this magnitude have no 
precedent in recorded history, to understand them and to compile 
observations against which to test models of future climate change, it 
is necessary to turn to the geological record. 

In this Article, we analyse a new compilation of geographically 
dispersed sea level indicators spanning the last interglacial stage 
(LIG), which climaxed about 125,000 years ago (125 kyr ago). The 
LIG (also known as the Eemian stage, its local northern European 
name, and as Marine Isotope Stage 5e) is of special interest for three 
reasons: (1) it is recent enough that it is possible to obtain some sea 
level records with high temporal resolution and many more observa- 
tions with lower temporal resolution; (2) due in large part to 
enhanced Northern Hemisphere insolation, global and polar tem- 
peratures may have been slightly warmer than at present; and (3) 
several lines of evidence suggest that GSL was higher than today, 
perhaps by 4—6m (ref. 1), and that the Greenland Ice Sheet and 
possibly also the West Antarctic Ice Sheet**’ were significantly smal- 
ler than they are now. 

During the LIG, greenhouse gas concentrations were comparable 
to pre-industrial Holocene levels’, but Earth’s orbital eccentricity was 
more than twice the modern value®. Energy balance modelling pre- 
dicts that, as a consequence, summer temperatures between 132 and 


124 kyr ago on all land masses except Antarctica were at least 0.5 °C 
warmer than today’, while a more complete climate model indicates 
summer temperatures 2—4°C warmer than today in most of the 
Arctic®. Ice core data from both Greenland and Antarctica suggest 
polar temperatures in both hemispheres of about 3-5 °C warmer 
than today’, comparable to the 3-6 °C of Arctic warming that is 
expected to accompany 1-2 °C of global warming”. In Europe, pollen 
data suggest middle Eemian summer temperatures about 2 °C warmer 
than present'!. While the change in global mean temperature is uncer- 
tain, sea surface temperatures in the equatorial Pacific’’ and Atlantic’ 
were about 2 °C warmer than pre-industrial levels. 

Synthesizing geological sea level indicators into a global reconstruc- 
tion requires accounting for regional variability. Differences between 
LSL and GSL arise because—contrary to an analogy commonly taught 
in introductory classes—adding water from melting land ice to the 
ocean is not like pouring water into a bathtub. Many factors other than 
the changing volume of water in the ocean modulate the influence of 
melting ice sheets on LSL. These factors include: the direct gravitational 
effect of the distribution of ice, water and sediment on the sea surface 
(or geoid), solid Earth deformation and its associated gravitational 
signature, perturbations to both the magnitude and orientation of 
the Earth’s rotation vector, and time-varying shoreline geometry'*"°, 
as well as changes in ocean and atmosphere dynamics’’. In addition, 
LSLs are influenced by tectonic uplift and thermal subsidence. 

As a consequence of these factors, LSLs at Pacific islands far from 
the late Pleistocene ice sheets were 1-3m higher in the middle 
Holocene than today, even though GSL was essentially unchanged”. 
Similarly, even if GSL was never higher than today, LSLs several 
metres higher than present could have occurred far from the former 
Laurentide Ice Sheet (for example, in Australia) early in the LIG, and 
comparably high LSLs could have occurred closer to the former ice 
sheet (for example, in the Caribbean) late in the LIG’’. Without accurate 
and precise dating of the relevant sea level indicators and an appre- 
ciation of the difference between LSL and GSL, such patterns could 
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Figure 1 | Sites with at least one sea level observation in our database. The 
symbol shapes reflect the nature of the indicators (upward triangles, 
isotopic; circles, reef terraces; downward triangles, coral biofacies; squares, 
sedimentary facies and non-coral biofacies; diamonds, erosional). The 
colours reflect the number of observations at a site (blue, 1; green, 2; 
magenta, 3; red, 4 or more). 


produce the false appearance of a magnified or diminished GSL high- 
stand. In order to estimate ice sheet history from sea level records, it is 
thus necessary to account for physical factors like gravitation and solid 
Earth deformation. Conversely, because these effects cause LSL changes 
to differ with distance from an ice sheet, a global database of LSL 
indicators can potentially address not just whether global ice volume 
was smaller during the LIG than today, but also what combination of 
melting ice sheets, if any, was responsible for higher GSL. 

We construct a database of sea level indicators that is as compre- 
hensive as possible (Figs 1, 2; full data set available in Supplementary 
Information) and use it to estimate the posterior probability distri- 
bution of LSL as a function of space and time and of GSL and ice sheet 
volumes as functions of time. We must cope with variable geochro- 
nological uncertainty, as well as with variable errors in sea levels 
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inferred from proxy data and in estimates of regional long-term 
tectonic uplift or thermal subsidence. In addition, some of the data 
provide only upper or lower bounds to sea level. Where possible, we 
also want to take advantage of quasi-continuous sequences, in which 
relative timing is known with greater precision than absolute dates. 
These sequences include a stacked global oxygen isotope curve from 
benthic foraminifera”’, as well as series of LSL measurements inferred 
from sedimentary facies in the Netherlands”! and from hydrological 
modelling of foraminiferal oxygen isotopes in the Red Sea”. (These 
series are described in detail in Supplementary Information.) 


Statistical approach 

The ultimate goal of our analysis is to determine the posterior prob- 
ability distribution of LIG sea level and ice volume through time, 
conditioned upon the measurements in our database. Inherent in the 
method is the assumption that both the prior and posterior distribu- 
tions are multivariate Gaussian. 

We construct a prior probability distribution from the global oxygen 
isotope curve and its associated age model’®, as described in detail in 
Methods and Supplementary Information. To do this, we use a phy- 
sical model of LSL that calculates the eustatic, gravitational, deforma- 
tional and rotational effects of melting ice sheets'*'®”*. We estimate the 
mean and covariance of the prior distribution by averaging the values 
and covariances of the LSLs and of GSL obtained by running many 
alternative ice sheet histories through a forward physical model. These 
histories themselves are sampled from two underlying distributions: a 
distribution for global ice volume over time based upon ref. 20 and a 
distribution for individual ice sheet volumes conditioned upon global 
ice volume. This latter distribution is based upon random perturba- 
tions of a model of Last Glacial Maximum (LGM)-to-present ice sheet 
volume” with additional allowances made for ice sheets smaller than 
their present volumes. To approximate thermosteric effects resulting 
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Figure 2 | Localities at which LSL data exist in our database, for time slices 
through the LIG. The diameter of each circle scales as indicated with the 
probability that the corresponding data point occurs in the indicated 
interval. The horizontal (vertical) lines are proportional to the standard 
deviations of the age (sea level) measurements. The intersection of the lines 
reflects the mean age estimate relative to the age window; a rightward skew 
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reflects a mean estimate earlier than the middle of the window. Data that 
provide only upper or lower sea level bounds are indicated by downward and 
upward triangles, respectively. Colours indicate the mean sea level estimate 
in metres above present value. Some symbols overlap; for a complete table of 
observations, see Supplementary Information. 
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from changes in mean ocean temperature and salinity, we add two 
Gaussian terms: a term independent of time and GSL with a mean of 
0m anda standard deviation of 2 m, and a term that varies with global 
ice volume (—1.6 + 0.6m per 100m equivalent sea level (e.s.1.) ice 
sheet growth). The temporal covariance of these thermosteric terms 
has an e-folding time of 2 kyr. The uncertainty within the thermosteric 
terms is large enough to also accommodate small contributions from 
other sources, such as small mountain glaciers present today but not 
included in the LGM-to-present ice model. 

To construct the posterior distribution of sea level at any arbitrary 
point in space and time, we start with the simpler problem of estimating 
the posterior probability distribution of sea level at the points included 
in our database and then interpolate to calculate values at points not in 
our database. We employ a three-step Gibbs sampler” to sample the 
Bayesian network illustrated in Fig. 3. 

In the first step, we calculate corrected measurements of LSLs (s) by 
adjusting the altitude of our proxy observations (z) for their deposi- 
tional settings (D), which account for the relationship between proxy 
altitudes and sea level elevation at the time of formation, and for the 
background regional uplift or subsidence. The former correction 
incorporates sedimentological and geomorphological knowledge, 
such as the fact that most coral observations in the database are of 
species that grow between 0 and 5 m below mean low tide level”*”’, as 
well as information about local tidal range. The latter correction is 
based upon an estimate of the regional uplift or subsidence rate (u) 
anda sample from the posterior distribution of measurement ages (g). 
In selecting or constructing uplift or subsidence rate estimates, we 
have avoided estimates from the literature that assume LIG sea level 
as a reference point. 

In the second step, we employ Gaussian process regression to 
estimate the true sea levels (f). Gaussian process regression”, of 
which the commonly used geospatial technique of kriging interpola- 
tion is a well-known example, treats a field (such as sea level) as a 
collection of random variables drawn from a multivariate Gaussian 
distribution. By specifying the covariance structure of the field, 
knowledge about the relevant physics affecting the process can be 
incorporated into the modelling without constraining it to fit a par- 
ticular forward model. 

In the third step, we use the Metropolis-Hastings algorithm” to 
draw a new Markov chain Monte Carlo sample of the ages (g), based 
upon the measured ages (t) and the current estimate of the true sea 
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Figure 3 | Schematic illustration of the process used in our statistical 
analysis. See text for details. 
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levels (f). Repeating this sequence many times allows us to sample the 
posterior probability distribution for LSL and GSL in a way that 
satisfies the measurements to within their uncertainties. 

Equipped with an estimate of the posterior probability distri- 
bution, we can then answer questions such as ‘what was the maxi- 
mum GSL attained during the LIG’ and ‘what was the fastest rate at 
which GSL rose when it was within 10m of its present value?’ (As 
discussed below, we focus on rates above the —10m threshold 
because the Laurentide Ice Sheet was comparable in size to the 
modern Greenland Ice Sheet by the time GSL rose to this level in 
the Holocene.) To answer such questions, we draw many samples 
from the posterior distribution and examine the distribution of 
answers based on these samples. We report these answers as excee- 
dance values. For instance, the 95% probability exceedance value of 
GSL is exceeded in 95% of all samples. If the 95% exceedance value is 
6.6 m, we can reject the hypothesis that sea level never exceeded 6.6 m 
at the 95% confidence level. Note that the answer to such questions is 
not identical to the answer one would get by looking at the median 
projection of GSL and reading its maximum; the maximum of the 
median would be the 50% probability exceedance value if all time 
points were perfectly correlated, but such is not the case. The median 
reconstruction instead represents the best estimate for GSL at each 
specific point in time, whereas the exceedance values are calculated 
across the entire LIG interval. 


Results of global analysis 

Applying our algorithm to the full data set of LIG sea level indicators 
yields a GSL curve (Fig. 4a) with a median projection that peaks at 
124 kyr ago at 7.2 + 1.3m (67% confidence interval). Further ana- 
lysis reveals a 95% probability of having exceeded 6.6 m at some time 
during the LIG highstand and a 67% probability of having exceeded 
8.0 m (Fig. 5, solid line). It is unlikely (33% probability) that GSL 
exceeded 9.4m. 

To test the sensitivity of these results, we analysed seven subsets of 
the data: one subset excluding the Red Sea oxygen isotope curve, and 
six either excluding or including only (1) coral data, (2) erosional 
features, or (3) facies interpretations (Supplementary Information). 
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Figure 4 | Probability density plots of GSL and ice volume during the LIG. 
a, Global sea level (GSL); b, 1,000-year average GSL rates; c, Northern 
Hemisphere (NH) ice volume; and d, Southern Hemisphere (SH) ice volume. 
Heavy lines mark median projections, dashed lines the 16th and 84th 
percentiles, and dotted lines the 2.5th and 97.5th percentiles. Red crosses mark 
median posterior estimates of sample ages. Vertical lines mark the interval 
when >30% of samples from the distribution have standard deviations of GSL 
<30% of the prior standard deviation (and are thus included in calculations of 
exceedance probabilities). The horizontal line at 0 indicates modern values in 
a, c and d and unchanging GSL in b. We urge caution in interpreting ice 
volume projections (¢, d) owing to the use of a Gaussian distribution to 
represent a non-Gaussian prior. e.s.l., equivalent sea level. 
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Figure 5 | Exceedance values calculated from the posterior probability 
distribution. The solid line shows GSL rise, the dashed line shows the 1,000- 
year average rate of change of GSL when GSL is at or above — 10 m, and the 
dotted line shows ice loss in the hemisphere with the least ice loss. 


The results from these subsets were fairly consistent. Across all subsets, 
the median projection peaked between 6.4 and 8.7 m. With the excep- 
tion of the subset containing only erosional features, the 95% prob- 
ability exceedance value ranged from 5.7 to 7.0 m, the 67% probability 
value ranged from 7.3 to 8.7 m, and the 33% probability value ranged 
from 8.4 to 10.5 m. (The values for the subset containing only ero- 
sional features were slightly lower and more broadly spread, with 95%, 
67% and 33% values of —0.3 m, 3.9m and 6.8m, respectively. The 
spread reflects the relatively high uncertainty on this projection, which 
results in large part from a smaller data set.) We therefore consider our 
results to be reasonably robust with respect to different observations. 

The 95%, 67% and 33% probability exceedance values for 1,000-year 
average GSL rise rate during the interval when GSL was =— 10m are 
5.6mkyr_',7.4mkyr_| and 9.2 mkyr ', respectively (Fig. 4b; Fig. 5, 
dashed line). We emphasize that these values by no means exclude faster 
intervals of sea level rise lasting for less than one millennium. 

We can also attempt to answer questions about the magnitude of 
ice sheet volume based on the posterior probability distribution, but 
we must do so with caution. The distribution of Northern 
Hemisphere ice volume, in particular, can only be roughly approxi- 
mated with a Gaussian, as it has a hard upper bound set by the fact 
that there is only about 7 m e.s.l. of Northern Hemisphere ice avail- 
able to melt today. Because of this limitation, although we directly 
present the hemispheric ice volume posteriors in Fig. 4c, d, we make 
only one fairly conservative inference regarding ice sheet volumes. 
The posterior distribution suggests a 95% probability that both 
Northern Hemisphere ice sheets and Southern Hemisphere ice sheets 
reached minima at which they were at least 2.5 m e.s.l. smaller than 
today, although not necessarily at the same point in time (Fig. 5, 
dotted line). We can make no strong statements about in which 
hemisphere the ice shrunk to a greater extent; in 59% of samples, it 
was the Southern Hemisphere and in 41% of samples, it was the 
Northern Hemisphere. Additional sea level proxies close to the ice 
sheets would help increase the precision of these estimates, as might a 
non-Gaussian model for the prior distribution. 


Comparison to previous estimates 


Previous estimates of LIG sea level, which were generally in the range 
4-6 m, were based on interpretations of LSL at a small number of 
localities. The Fourth Assessment Report of the IPCC’ highlighted 
Hawaii and Bermuda®’; other authors*! also include observations 
from the Bahamas, Western Australia and the Seychelles Islands. 
All these localities are relatively tectonically stable and experience 
only slow thermal subsidence, associated with the cooling of the 
lithosphere. If one had to draw conclusions about GSL from a small 
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number of LSL measurements, these are reasonable sites at which to 
look. 

Other commonly considered localities, such as Barbados” and the 
Huon Peninsula*’, are rapidly uplifting localities. These sites have 
advantages as relative sea level recorders, most notably that terraces 
recording sea levels below present are readily accessible. Assuming 
these sites have experienced a steady rate of uplift, they can help 
uncover sea level variations over fairly short timescales. However, 
they are poor sites from which to draw conclusions about absolute 
sea levels, as recovering this information requires a precise estimate of 
uplift rate. Because our method incorporates knowledge about the 
associated uncertainties, we can include both stable and uplifting 
sites into our analysis. 

To our knowledge, only one previous study’, which used a fairly 
limited set of observations, has attempted to account for the effects of 
glacial isostatic adjustment in drawing conclusions about GSL and 
ice volume from LIG sea level records. As that study demonstrated, 
understanding the influence of these effects is critical, as otherwise 
LSL highstands could easily be falsely interpreted as reflecting global 
highstands. Our statistical model uses the covariance between local 
and GSL, derived from many runs of a forward physical model, to 
account for the gravitational, deformational and rotational effects of 
the ice-ocean mass redistribution. Our results indicate that the 
apparent high GSL during the LIG is indeed real, though previously 
underestimated. 


Rates of sea level change 


Our results suggest that during the interval of the LIG when sea level 
was above —10m, the rate of sea level rise, averaged over 1 kyr, was 
very likely to have reached values of at least about 5.6 mkyr™ | but was 
unlikely to have exceeded 9.2 mkyr_'. Our data do not permit us to 
resolve confidently rates of sea level change over shorter periods of 
time. Our inferences are consistent with estimates of the rate of the 
contribution of Laurentide Ice Sheet meltwater to GSL during the 
early Holocene; the Laurentide Ice Sheet contribution is estimated to 
account for about 7 mkyr~' during the period when GSL climbed 
above — 10m (ref. 34). 

Ice volume during the late deglacial rise at the start of the LIG was 
only slightly larger than at present. The Laurentide Ice Sheet would 
have been a shrunken remnant of its once extensive mass—or, perhaps 
two small remnants, one over Québec and Labrador and one over 
eastern Nunavut and Baffin Island, as in the early Holocene”. As 
the Laurentide Ice Sheet was within a factor of two in size of the present 
Greenland Ice Sheet, its dynamics may have been analogous to those of 
the Greenland Ice Sheet. The results from the LIG suggest that, given a 
sufficient forcing, the present ice sheets could sustain a rate of GSL rise 
of about 56-92 cm per century for several centuries, with these rates 
potentially spiking to higher values for shorter periods. 


Discussion 


Although it is the approach most commonly taken when the LIG is 
used as an analogue for near-future warming, GSL and global ice 
volume cannot be accurately inferred by a qualitative examination 
of LSL at a handful of localities. Better control is afforded by a more 
thorough approach that combines, as we do, an extensive database of 
sea level indicators with a probabilistic assessment of their interpre- 
tive and geochronological errors. The results of our analysis support 
the common hypothesis that LIG GSL was above the current value, 
but contrary to previous estimates, we conclude that peak GSL was 
very likely to have exceeded 6.6 m and was likely to have been above 
8.0 m, though it is unlikely to have exceeded 9.4 m. 

The LIG was only slightly warmer than present, with polar tem- 
peratures similar to those expected under a low-end, ~2 °C warming 
scenario. Nonetheless, it appears to have been associated with sub- 
stantially smaller ice sheets than exist at present. Achieving GSL in 
excess of 6.6m higher than present is likely to have required major 
melting of both the Greenland and the West Antarctic ice sheets, an 
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inference supported by our finding that both Northern and Southern 
hemisphere ice volumes are very likely to have shrunk by at least 
2.5me.s.L relative to today. Incorporating a large database of palaeo- 
climatic constraints thus highlights the vulnerability of ice sheets to 
even relatively low levels of sustained global warming. 


METHODS SUMMARY 


We assembled our database, which includes observations from 42 localities, 
through an extensive literature search for indicators with best estimates of ages 
between 140 and 90 kyr ago. To each indicator we assigned a depth range of 
formation or deposition based upon geomorphological and sedimentological 
interpretation. See Methods and Supplementary Information for full details of 
the database, the statistical analysis algorithm, and the physical model used to 
generate the covariance function. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Database of LIG sea level indicators. We characterize each LIG sea level indi- 
cator (indexed by i) by five parameters: its geographical position (r;), its mea- 
sured altitude with respect to mean tide level (z;), its measured age (t;), the range 
of depths at which it might have formed (Dj), and the estimated local uplift or 
subsidence rate (u;). Some of the observations are censored, in that they provide 
only an upper or lower bound to sea level. When more than one observation 
comes from the same locality, we also record stratigraphic order and, where 
available, estimates of the relative ages of observations. With the exception of 
geographical position, each of these variables has uncertainties that we assume 
follow a Gaussian distribution. For some values, including all depositional depth 
ranges, uniform distributions between two limits a and b may be a better choice 
than Gaussian ones. In these cases, we substitute a Gaussian distribution with the 
same mean and standard deviation as the uniform distribution, that is, 
(b—a) / 12. Depositional ranges D; are thus replaced with Gaussian estimates 
d;. The full database is supplied in Supplementary Information. 

Prior distribution. We assume that sea level is a Gaussian process with a spatially 
and temporally varying covariance described by the function k(1j, gi rj gj). There 
is no uncertainty on spatial location r;, but the temporal variable is g;, the model 
age (see Fig. 3). We approximate k by k, which is produced by sampling alterna- 
tive histories from a forward model that incorporates the relevant physics. To 
stabilize the estimate and reduce variability related to finite sample size, we 
smooth k with a Gaussian temporal taper function: k(rj, g;; 1), g)) = 


ko (ri, Si Ts gj) exp[— (g: -g)° /?| , as discussed in the Supplementary 


Information. To produce the results described in the main text, we employed 
t = 3kyr. Results from other values are shown in Supplementary Information. 
The prior probability distribution is based upon the age model of ref. 20, 
which places the start of the deglaciation at about 135 kyr ago and the start of 
the LIG highstand at about 127 kyr ago. For consistency, we have aligned the Red 
Sea and Dutch sequences against this record and excluded from the main analysis 
three observations from the Houtman-Abrohlos Islands***’ whose ages are 
inconsistent with this model. There is, however, considerable disagreement 
among current age models. Reference 38 (adopted in ref. 22) places the start 
of the highstand at about 125 kyr ago, 2 kyr later than ref. 20, while ref. 39 places 
the start of the deglaciation at between 137 and 142 kyr ago, 2—7 kyr earlier. Our 
results do not attempt to address these differences, and should be viewed in the 
context of the ref. 20 timescale. 
Physical model. The physical model is based on a gravitationally self-consistent 
sea-level equation’’ that extends earlier work"* to take exact account of shoreline 
migration due to either local sea-level changes (which give rise to offlap or onlap) 
and changes in the extent of grounded, marine-based ice. The calculations are 
performed using a pseudo-spectral sea-level solver'*“° with a truncation at 


nature 


spherical harmonic degree and order 256. The solver incorporates the feedback 
on sea level of contemporaneous, load-induced perturbations in the Earth’s 
rotation vector'®, where these perturbations are computed using the new ice- 
age rotation theory of ref. 23. The sensitivity to Earth structure is embedded 
within viscoelastic surface load and tidal Love numbers*'**. We adopt spherically 
symmetric, self-gravitating, Maxwell viscoelastic Earth models. The elastic and 
density structure of these models is given by the seismic model PREM (ref. 43). 
The viscosity profile is discretized into three layers, including: (1) an extremely 
high (essentially elastic) lithospheric lid of thickness LT; (2) a uniform viscosity 
from the base of the lithosphere to 670 km depth (that is, the sub-lithospheric 
upper mantle) which we denote as vy; and (3) a uniform lower mantle viscosity 
(that is, from 670 km depth to the core-mantle boundary) denoted by vjy4. We 
consider a suite of 72 such Earth models generated by using the following 
choices: LT = 70, 95, or 120 km; vyyyq = 0.3, 0.5, 0.8 or 1.0 X 10°! Pas; vem = 2, 
3, 5, 8, 10, or 20 X 10°! Pas. 

As described in Supplementary Information, we generate an estimate of the 
prior sea level covariance k by running the model 250 times with different ice 
sheet histories and randomly selected viscosity profiles. From these runs, we 
compute the covariance among LSLs at evenly spaced points, GSLs and ice sheet 
volumes, as well as at the exact coordinates of the sites in our database, and we 
store the results as a lookup table. Total ice volume in the different ice sheet 
histories is sampled from a distribution based upon the ref. 20 global oxygen 
isotope curve. The ice volume of individual ice sheets is sampled from a prob- 
ability distribution for individual ice sheet volumes that is conditional upon total 
global ice volume. This latter distribution is constructed from random perturba- 
tions of LGM-to-present ice models”*. 
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Effects of susceptibility variants may depend on from which parent they are inherited. Although many associations between 
sequence variants and human traits have been discovered through genome-wide associations, the impact of parental origin 
has largely been ignored. Here we show that for 38,167 Icelanders genotyped using single nucleotide polymorphism (SNP) 
chips, the parental origin of most alleles can be determined. For this we used a combination of genealogy and long-range 
phasing. We then focused on SNPs that associate with diseases and are within 500 kilobases of known imprinted genes. 
Seven independent SNP associations were examined. Five—one with breast cancer, one with basal-cell carcinoma and three 
with type 2 diabetes—have parental-origin-specific associations. These variants are located in two genomic regions, 11p15 
and 7q32, each harbouring a cluster of imprinted genes. Furthermore, we observed a novel association between the SNP 
rs2334499 at 11p15 and type 2 diabetes. Here the allele that confers risk when paternally inherited is protective when 
maternally transmitted. We identified a differentially methylated CTCF-binding site at 11p15 and demonstrated correlation 


of rs2334499 with decreased methylation of that site. 


The effect of sequence variants on phenotypes may depend on 
parental origin. The most obvious scheme, although not the only 
one’, is imprinting in which the effect is limited to the allele inherited 
from a parent of a specific sex. Despite this, most reports of genome- 
wide association studies have treated the paternal and maternal 
alleles as exchangeable. This is understandable, as the information 
required is often unavailable, but it reduces the power of such studies 
to discover some susceptibility variants and underestimates the 
effects of others, contributing to unexplained heritability. Here we 
describe a method that allows us to determine the parental origin of 
haplotypes systematically even when the parents of probands are not 
genotyped. We use the results to discover associations that exhibit 
parental-origin-specific effects. 


Determining parental origin 


Long-range phasing allows for accurate phasing of Icelandic samples 
typed with Illumina BeadChips for regions up to 10 cM in length’. 
Two advances have been made since then, stitching and parental- 
origin determination. Genome-wide, long-range phasing was 
applied to overlapping tiles, each 6 cM in length, with 3-cM overlaps 
between consecutive tiles. For each tile, we attempted to determine 
the parental origins of the two phased haplotypes regardless of 
whether the parents of the proband were chip-typed. Using the 
Icelandic genealogy database, for each of the two haplotypes of a 
proband a search was performed to identify, among those individuals 


also known to carry the same haplotype, the closest relative on each of 
the paternal and maternal sides (Fig. 1). Results for the two haplo- 
types were combined into a robust single-tile score reflecting the 
relative likelihood of the two possible parental-origin assignments 
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Figure 1| An example of determination of parental origin. In blue and red 
are two phased haplotypes of a proband. Among other typed individuals, the 
closest paternal relative known also to carry the blue haplotype is R,, a 
cousin; the corresponding maternal relative is Rj. For the red haplotype, a 
maternal aunt (R3) carries the haplotype, and the closest known carrier on 
the paternal side is R,. Because R, is a closer relative than R2, and R; is a 
closer relative than R4, the blue and red haplotypes are probably paternally 
and maternally inherited, respectively. The single-tile score (Methods) 
supporting this assignment is 0.194. 
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(with a score greater than zero supporting one assignment and a score 
less than zero supporting the other assignment; see Methods for 
details). We then tried to stitch the haplotypes from consecutive tiles 
together on the basis of sharing at the overlapping region. Stitching 
and parental-origin determination are complementary tasks. 
Specifically, if parental origin is determined with high confidence 
for one tile, the information can be propagated to other tiles through 
stitching (Supplementary Fig. la). Conversely, in cases in which the 
overlap between two adjacent tiles is homozygous for all SNPs, stitch- 
ing can still be accomplished if parental origins can be determined for 
both tiles independently (Supplementary Fig. 1b). For haplotypes 
derived by stitching, a contig score for parental origin is computed 
by summing the individual single-tile scores. 

After filtering based on various quality and yield criteria, 289,658 
autosomal markers and 8,411 markers on chromosome X were used. 
Excluding those with no parent listed in the genealogy database or with 
a genotyping yield of less than 98%, 38,167 individuals, the majority 
typed with Illumina HumanHap300 or CNV370 BeadChips (Sup- 
plementary Information), were processed. For these individuals, 
97.8% of the heterozygous genotypes were long-range phased, and 
in 99.8% of these the parental origin was determined. Overall, 
3,841,331,873 heterozygous genotypes, or 97.7% of all heterozygous 
genotypes, had parental origin assigned. The data includes 2,879 typed 
trios. To evaluate the accuracy of our method empirically, a run was 
performed with the data for parents in these trios removed when 
determining parental origin. For 231,585,437 heterozygous genotypes 
in the probands/offspring, parental origin was determined both by our 
method and using the trio data directly, with 500,330 discrepancies, an 
error rate of 0.22%. Because the trios tested passed heritability checks 
in preprocessing, the error rate for individuals with fewer than two 
parents genotyped is probably higher. Nevertheless, the overall error 
rate is probably less than 0.4% (Supplementary Information). 


Table 1| Parental-origin-specific analyses of disease-susceptibility variants 
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Imprinting and disease association 


Although many mechanisms can lead to parental-origin-specific 
association with a phenotype, sequence variants located close to 
imprinted genes are more likely to exhibit such behaviour a priori. 
Through two sources, ref. 3 and the Imprinted Gene Catalogue**, we 
found forty-eight genes known to be imprinted in humans 
(Supplementary Table 1). Selecting regions that fall within 500 kilo- 
bases (kb) of any of these genes (NCBI build 36 of the human genome 
assembly) amounts to approximately 1% of the genome. The 500-kb 
threshold was chosen because imprinted genes often occur in clusters 
and the imprinting status of genes close to known imprinted genes is 
often undetermined. It is also known that a sequence variant can 
directly affect the function of a gene located some distance away. 
Among the 298,069 SNPs we processed, 3,840 fall within these 
selected regions. 

By consulting the US National Institutes of Health Office of 
Population Studies catalogue of published genome-wide association 
studies® (accessed 25 April 2009), we intersected reported SNP- 
disease associations with P<5 X10 * with the selected regions 
(Supplementary Table2). After further restriction to diseases for 
which genome scans have been published based on Icelandic data, 
four associations remained. Three other SNP associations we were 
aware of that fall within the imprinted regions, one recently published 
for basal-cell carcinoma’ and two new type 2 diabetes (T2D) variants 
discovered in the Diabetes Genetics Replication and Meta-analysis 
(DIAGRAM) sample set (unpublished data, DIAGRAM Consortium; 
Supplementary Information), were also examined. 


Association analysis 

For each disease-SNP association, five tests were performed (Table 1). 
We performed a standard case-control test without taking parental 
origin into account to provide a baseline. Then we performed a 


Disease, SNP [alleles]* Standard case-control test 


Tests of association with parental origins 


NCBI build 36 position, N M, Foon OR Pt Paternal allele8 Maternal allele& 2-df. test| | Paternal vs maternal (case only) 
OR OR P P n12:n219 P 

Breast cancer, rs3817198+ [C/T] 

C111,865,582, 34,909, 

1,803 0.303 1.04 0.36 17 0.038 0:91 0.11 0.0040 437:339 6.2x10°* 

Basal-cell carcinoma, rs157935 [T/G] 

C7 130,236,093, 37,041, 

1,118 0.676 123 1.8x10~° 40 15x10 1.09 0.19 3.8x10°° 237:182 0.010 

T2D, rs2237892 [C/T] 

C11 2,796,327, 34,706, 

1,468 (discovery) 0.925 1.19 0.044 14 0.24 1.24 0.071 0.095 81:90 0.51 

783 (replication) 1.08 0.43 0.87 0.30 1.43 0.024 0.050 35:59 0.014 

2,251 (combined) 1.15 0.043 03 0.71 1.30 0.0084 0.027 116:149 0.054 

T2D, rs231362} [C/T] 

C112,648,047, 33,377, 

1,423 (discovery) 0.551 1.09 0.051 0.97 0.67 1.23 0.0010 0.0037 329:401 0.014 

750 (replication) 1.10 0.073 00 0.99 1.22 0.011 0.037 158:191 0.098 

2,173 (combined) 1.10 0.013 0.98 0.73 123 6.2x10-° 2.6 x 10-4 487:592 0.0032 

T2D, rs4731702 [C/T] 

C7 130,083,924, 34,706, 

1,468 (discovery) 0.439 1.15 0.0018 .07 0.24 1.23 64x 10% 0.0013 335:374 0.17 

783 (replication) 0.95 0.38 0.84 0.024 1.08 0.31 0.048 163:204 0.037 

2,251 (combined) 1.08 0.039 0.99 0.79 LAT 0.0010 0.0041 498:578 0.022 

T2D, rs2334499 [T/C] 

C11 1,653,425, 34,706, 

1,468 (discovery) 0.412 1.11 0.017 1.41 43x10°° 0.87 0.020 35x10? 437:276 7.0x10°° 

783 (replication) 02 0.71 1.23 0.0055 0.84 0.023 0.0018 222157 8.0x10 4% 

2,251 (combined) 1.08 0.034 135 47x10 0.86 0.0020 5.7x10°%* 659:433 4.1x107*4 


NCBI build 36 position is shown in terms of chromosome and base number. N, case sample size; M, control set size; F.o,, control frequency (frequency of the risk allele in controls). 
* The first allele is the risk allele on the basis of analyses that do not take into account parent of origin. 


+ Imputed allele probabilities were used. 
Genomic control was applied (true for all Pvalues shown). 


§ The effect of the paternally inherited allele was tested by comparing the corresponding alleles in cases with those in controls. The effect of the maternally inherited allele was tested similarly. 


effect is tested. 


| |The test assumes a multiplicative effect for the paternally and maternally inherited alleles, but allows the effects to be different under the alternative hypothesis when the null hypothesis of no 


§|To test directly whether the paternally and maternally inherited alleles have different effects, their allele frequencies were compared within the cases. Information for this test was mainly captured 
by the counts of the two types of heterozygote: n12 denotes the number of cases who have inherited allele 1 from the father and allele 2 from the mother, and n21 denotes the number of cases who 


have inherited allele 2 from the father and allele 1 from the mother. 
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case-control analysis separately for the paternally and maternally 
inherited alleles. A 2-d.f. test was applied to evaluate the joint effect. 
A multiplicative model was assumed for the two alleles, but the mag- 
nitude and direction of the effect were allowed to differ. Finally, the 
difference between the effects of the paternally and maternally inher- 
ited alleles was directly tested by comparing their allele frequencies 
within cases. The information for this test came mainly from the 
counts of the two types of heterozygote within cases (Supplemen- 
tary Information). 

Two of the seven associations examined, one with prostate cancer 
and another with coronary artery disease, did not exhibit parental- 
origin-specific effects (Supplementary Information and Supplemen- 
tary Table 3). The five associations that did are presented here. 
Breast cancer. Allele C of rs3817198 in the 11p15 region (Fig. 2) was 
reported® to be associated with breast cancer with an allelic odds ratio 
of OR = 1.07 (P=3 X10 ”). This study included about 21,860 cases 
and 22,578 controls, allowing this modest effect to achieve genome- 
wide significance. A study’ of 9,770 cases and 10,799 controls in the 
Cancer Genetic Markers of Susceptibility project reported odds ratios 
of 1.02 and 1.12 for heterozygous and homozygous carriers of the same 
variant, respectively. Using information in their supplementary 
material, we deduced a P value of 0.06. Marker rs3817198 is not on 
the Illumina chips used to type the majority of the Icelandic samples, 
but is included on the Illumina 1M BeadChips for which we have data 
on 124 trios. We used a single-track assay to type another 90 trios, 
giving a total of 214 trios with genotypes for rs3817198, which trans- 
lates to a training set of 856 haplotypes. Adapting the statistical model 
used by IMPUTE" allele probabilities of rs3817198 were calculated for 
individuals with phased and parental-origin-determined haplotypes 
for this region (Supplementary Information). With the imputation 
results for 1,803 cases and 34,909 controls (Table 1), the standard 
case-control test gave a non-significant odds ratio of 1.04 (P= 0.36). 
However, when parental origin was taken into account, the paternally 
inherited allele showed a significant association (OR= 1.17, 
P= 0.0038). The direct test of parental-origin-specific effects that used 
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Figure 2 | Chromosome 11p15 locus. Markers associated with T2D 
(rs2334499, rs231362 and rs2237892) and breast cancer (rs3817198) are 
indicated. The two regions containing clusters of imprinted genes are 
shaded. The location of the CTCF-binding region studied (OREG0020670) 
and gene annotations were taken from the University of California, Santa 
Cruz, genome browser (http://genome.ucsc.edu/). Estimated recombination 
rates, from the International HapMap Project 
(http://hapmap.ncbi.nlm.nih.gov/), are plotted to reflect the linkage 
disequilibrium structure in the region. Mb, megabase. 
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only the case data was even more significant (P = 6.2 X 10~*). This is 
because the estimated effect of allele C when maternally inherited, 
although not significant (P = 0.11), is protective (OR = 0.91). 
Basal-cell carcinoma. We recently identified association of allele T of 
18157935, located at 7q32 (Fig. 3), with basal-cell carcinoma 
(OR = 1.23, P=5.7 X 10° !°)’. Limiting the analysis to samples for 
which parental origin could be determined, the paternally inherited 
allele was significantly associated with the disease (OR = 1.40, 
P=15X 10 °), but the effect of the maternally inherited allele, 
although it was in the same direction, was not significant 
(OR= 1.09, P=0.19; Table 1). Tested directly, the effects of the 
paternally and maternally inherited alleles were significantly different 
(P=0.01). 

Type 2 diabetes. Allele C of rs2237892 in the maternally expressed 
gene KCNQI was first observed to be associated with T2D in Asian 
populations'"'*. The power to detect association in populations of 
European ancestry is low owing to the high frequency of the variant 
there (~93% compared with ~61% in Asians), but the association 
has nonetheless been replicated'”'”. In the T2D samples we have 
previously used in genome scans (Table 1) including 1,468 cases, 
none of the tests involving parental origin were significant for 
182237892. However, with the addition of another 783 patients, 
giving a total of 2,251 cases (Supplementary Information), allele C 
was significantly associated with the disease (OR = 1.30, P = 0.0084) 
when maternally transmitted, whereas the results for the paternally 
inherited allele were flat (OR = 1.03, P= 0.71). 

Through a meta-analysis of eight T2D genome-wide scans of 
DIAGRAM sample sets with additional follow-up (Supplementary 
Information), allele C of rs231362 was shown to associate with the 
disease (OR = 1.08, P=3 X 10 !°). Marker 1s231362 is also located 
in KCNQI (Fig. 2), but it is not substantially correlated with 
12237892 (correlation coefficient, 7 = 0.002). Also, it is not on 
any of the Illumina chips used. A training set of 912 haplotypes, 
created through single-track-assay genotyping of 228 trios, was used 
for imputation of rs231362 into the Icelandic samples. Using the 
imputed results, the standard case-control test gave an odds ratio 
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Figure 3 | Chromosome 7q32 locus. Markers associated with T2D 
(rs4731702 and rs972283 (1° = 1; HapMap CEU)) and basal-cell carcinoma 
(rs157935) are indicated. The region containing the known imprinted genes 
is shaded. Gene annotations were taken from the University of California, 
Santa Cruz, genome browser. Estimated recombination rates (from 
HapMap) are plotted to reflect the linkage disequilibrium structure in the 
region. 
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of 1.10 (P= 0.013). The effect, however, appears to be limited to the 
maternally inherited allele (OR = 1.23, P= 6.2 X 10°). 

Another association with T2D in DIAGRAM samples involves 
allele C of 184731702 at 7q32 (OR = 1.07, P=2 X 107 '°; Fig. 3). In 
our combined Icelandic samples, the association was again restricted 
to the maternally inherited allele (OR = 1.17, P= 0.0010; OR = 0.99, 
P= 0.79 for the paternally inherited allele). 

Evaluating the seven known susceptibility variants jointly (the five 

highlighted above plus the two variants for prostate cancer and 
coronary artery disease), the test for no parental-specific effect for 
all gave a P value of <5 X 10°. Also, an analysis of false-discovery 
rate’’ indicates that it is likely that at least four of the five highlighted 
variants have true parental-origin-specific effects (Supplementary 
Information). 
A new diabetes susceptibility variant. Properly evaluating the stati- 
stical significance of the susceptibility variants described above 
requires adjusting for relatedness of the participants using the method 
of genomic control’. This required us to perform genome scans for 
these diseases (Supplementary Table 4 gives parental-origin test results 
for established susceptibility variants located outside the selected 
regions). The T2D scan performed with the initial sample set 
(Supplementary Information and Supplementary Fig. 2) gave a strik- 
ing result (Table 1). Allele T of rs2334499, at 11p15 (Fig. 2), showed 
such a weak association (OR = 1.11, P= 0.017) in the standard case- 
control test that it does not stand out ina genome-wide scan. However, 
taking into account parental origin, both the paternally inherited allele 
(OR = 1.41, P=4.3 X10 °) and the 2-d.f. test (P = 3.5 X 10°) were 
genome-wide significant. Most notably, the maternally inherited allele 
also showed nominally significant association, but the effect of allele T 
was protective (OR = 0.87, P= 0.020). Tested directly, the difference 
between the effects of the paternally and maternally inherited alleles 
was also genome-wide significant (P= 7.0 X 10~°). This SNP falls 
within 350 kb ofa large cluster of imprinted genes, making the results 
even more compelling. However, the observation that allele T is pro- 
tective when maternally inherited required replication. For this, we 
used an additional set of 783 chip-typed T2D cases. All tests involving 
parental origin were significantly replicated. For the combined analysis 
of the two sample sets (Supplementary Information and Supplemen- 
tary Fig. 3), the paternally inherited allele had an odds ratio of 1.35 
(P= 4.7 X10 '°) andthe maternally inherited allele had an odds ratio 
of 0.86 (P = 0.0020). The 2-d.f. test and the paternal-versus-maternal 
test gave P values of 5.7 X 10 '! and 4.1 X 107 '', respectively. 

As there are known examples in an imprinted setting where the 
paternal and maternal alleles interact'’, we tested rs2334499 for an 
interactive effect. This test was not significant (P > 0.4; Supplemen- 
tary Information) indicating that the multiplicative model provides 
an adequate fit. Specifically, in comparison with CT (first allele paternal, 
second allele maternal), CC, TT and TC have relative risks of 1.17, 1.35 
and 1.57, respectively. 

The transmitted maternal allele has an effect in all four T2D 
variants in Table 1. Because prenatal maternal conditions may be a 
factor in conferring risk on the offspring, we examined the role of the 
non-transmitted maternal allele. No significant effect was observed 
(Supplementary Information). 


Imprinted regions at 11p15 and 7q32 


Imprinted genes at 11p15.5 fall into two clusters, H19/IGF2 and 
KCNQI (Fig. 2), regulated through separate imprinting control 
regions, each of which controls expression of a number of genes 
within the cluster'®. The H19/IGF2 imprinting control region is regu- 
lated through a differentially methylated region that is normally 
methylated only on the paternal chromosome. Binding of the insu- 
lator protein CTCF in the imprinting control region is permitted only 
on the unmethylated maternal chromosome, resulting in expression 
of IGF2 only from the paternal methylated chromosome and expres- 
sion of H19 from the maternal chromosome’. The breast cancer 
paternally associated marker rs3817198 resides within LSP1, 100 kb 
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downstream of H19 and within the same linkage disequilibrium 
block. The effect of this marker on breast cancer could thus be 
through the H19/IGF2 imprinted locus. Loss of imprinting at the 
H19/IGF2 locus, resulting in activation of IGF2 expression, has been 
reported in a number of different tumour types'*. Furthermore, loss 
of imprinting at the H19/IGF2 locus in normal tissue has also been 
shown to indicate a predisposition to colorectal cancer". 

The KCNQI cluster is regulated through an imprinting control 
region located in the promoter region of KCNQIOTI, a paternally 
expressed non-coding antisense RNA. Hypermethylation of the 
maternal allele results in monoallelic activity of the neighbouring 
maternally expressed protein-coding genes. The two T2D-associated 
markers at this locus, rs231362 and rs2237892, are both located 
within the maternally expressed KCNQI, consistent with the risk 
associations, 1s231362 also residing within the KCNQI1OT] antisense 
transcript (Fig. 2). 

Although both the T2D marker rs2334499 and the breast cancer 
marker rs3817198 fall within 350 kb of imprinted genes, the region 
harbouring them has not been reported to be imprinted"? (Fig. 2). 
Marker rs2334499 resides within the first intron of HCCA2, a gene 
which spans 300 kb containing several other genes (Fig. 2) including 
KRTAP5-1 to KRTAP5-6, DUSP8 and CTSD?°. To determine 
whether genes in this region showed signs of imprinting, we per- 
formed allele-specific expression analysis of HCCA2, CTSD and 
DUSP8 (Fig. 2), as well as three genes known to be imprinted in 
the 11p15.5 region (IGF2, KCNQI and KCNQIOT1), in RNA iso- 
lated from peripheral blood and adipose. Whereas allele-specific 
expression of IGF2, KCNQ1 and KCNQIOTI was confirmed in this 
data set, clear biallelic expression was seen for HCCA2 and DUSP8. 
However, excess paternal expression could not be ruled out for CTSD 
(Supplementary Information and Supplementary Table 6). 

The imprinted region at 7q32 consists of maternally expressed 
genes (CPA4 and KLF14) flanking paternally expressed genes 
(MEST and MESTIT1) (Fig. 3). The T2D-associated marker 
rs4731702 is located 14kb from the maternally expressed KLF14 
transcription factor*' and only increases risk of T2D when carried 
on the maternal chromosome. The basal-cell carcinoma variant 
rs157935, conferring risk through the paternal allele, is located 
170 kb telomeric to the imprinted region. 

We previously” correlated SNP genotypes from the Illumina 300K 
chip with gene expression using RNA samples from adipose tissue 
(N= 603) and peripheral blood (N= 745). Here, taking parental 
origin into account, we re-evaluated the correlation between the 
six variants in Table 1 and expression of genes at the 7q32 and 
11p15.5 loci. The T2D risk allele of rs4731702 at 7q32 correlated with 
lower expression of KLF14 in adipose tissue (P= 3 X 10 ~') when 
inherited maternally, but there was no effect when it was inherited 
paternally (Supplementary Table 7). Similar correlation was not seen 
in blood. Conversely, no strong correlation with parental-origin- 
specific gene expression was seen for the other disease-associated 
variants at 7q32 or 11p15.5 (Supplementary Table 7). 


Methylation of a novel CTCF-binding site 


Recent studies have mapped regions of CTCF-binding genome-wide 
for identification of insulator elements*’*. One of the sites identified 
(OREG0020670) is a 2-kb region located 17 kb centromeric to the 
T2D marker rs2334499 (Fig. 2 and Supplementary Fig.4). We 
assessed the methylation status of this CTCF-binding region in 
DNA samples derived from peripheral blood, using bisulphite 
sequencing. We identified a differentially methylated region of 180 
base pairs including seven CpG dinucleotides (Supplementary Fig. 4) 
where the ratio of 5-methyl cytosine (Cp) varied from around 0.1 to 
0.6. Methylation at five of the seven CpG dinucleotides (CpG-1 to 
CpG-5; Supplementary Fig. 4) was highly correlated (Supplementary 
Table 9). The estimated Cp ratio was tested for correlation with SNPs 
in a two-megabase surrounding region. The most significant correla- 
tion was observed between methylation status at CpG-4 and 
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Table 2 | Correlation between methylation of a CTCF-binding region and 
the T2D risk-variant rs2334499 


CpG Percentage methylation Effectt Pt 
dinucleotide (mean, s.e.)* 

CpG-1 22.9 (0.9) —5.7 65X10” 
CpG-2 15.3 (0.7) =k, 0.00055 
CpG-3 13.3 (0.8) —25 0.017 
CpG-4 56.7 (0.9) -8.4 26x10 13 
CpG-5 34.9 (0.9) -6.7 6.8x 10° 
CpG-6 22.2 (0.6) —1.0 0.24 
CpG-7 52.9 (1.1) = 0.30 


* Mean and standard error of the C/T allele ratio estimated by bisulphite sequencing of 168 
individuals. 

+ Change in percentage methylation per allele T of rs2334499 carried. 

t Significance of the correlation between methylation and rs2334499. 


182334499, for which P= 2.6 X 10 '° (Table 2). Furthermore, cor- 
relation between rs2334499 and methylation of CpG-1 to CpG-5 was 
significant. For these five CpG dinucleotides, the T2D risk allele 
correlated with decreased methylation and this effect was observed 
regardless of whether the allele was inherited from the father or the 
mother. By contrast, neither the breast cancer variant nor the two 
other T2D markers at 11p15.5 showed any correlation with the 
methylation status of this CTCF-binding site. 


Discussion 


Being able to determine parental origin of alleles and haplotypes in 
large samples opens new avenues to study associations between 
sequence variants and human traits. Standard association analysis pro- 
vides suboptimal power to discover disease susceptibility variants that 
exhibit parental-origin-specific effects. Even when association can be 
established, the true effect is underestimated. Marker rs2334499 did 
not gain serious attention even after the large collaborative effort of the 
DIAGRAM Consortium. However, its contribution to T2D, measured 
by the recurrent risks of siblings generated, is second only to that of the 
TCF7L2 variant among the known susceptibility variants (Supplemen- 
tary Information and Supplementary Fig. 2). Sequence variants, such 
as 182334499, that can confer both risk and protection depending on 
parental origin can lead to balanced selection and as a result promote 
diversity. 

Functional imprinting is extremely tissue and stage specific, and 
although some genes retain their imprinted status throughout life, 
the main role of imprinting is believed to be during prenatal growth 
and development. However, the associations of rs4731702 C with 
T2D and KLF14 expression in adult adipose tissue, in both cases only 
when maternally inherited, strongly implicates this transcription 
factor as the disease gene. 

We searched for evidence of epigenetic marks around the T2D risk 
variant rs2334499, as it is located some distance away from the estab- 
lished 11p15.5 imprinted genes. A CTCF-binding site in the region 
was found to be differentially methylated and the rs2334499 risk allele 
was shown to be correlated with decreased methylation. Given the 
well-established role of CTCF in imprinting, this new site could 
differentially influence the dosage of imprinted genes on the two 
parental chromosomes. 

Despite their successes, genome-wide association studies have so 
far yielded sequence variants that explain only a small fraction of the 
estimated heritability of most of the human traits studied. Obvious 
contributors to the unexplained heritability, or ‘dark matter’, include 
rare variants not well tagged by common SNPs and common variants 
that have very small effects individually. Results presented here 
demonstrate that a portion of the heritability of some common/ 
complex traits is hidden in more complex relations between sequence 
variants and the risks of these variants. 


METHODS SUMMARY 

Subjects. We used 38,167 Icelandic individuals who were genotyped using an 
Illumina SNP chip and processed for long-range phasing. See Supplementary 
Information for details of disease and control groups. 
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Genotyping. We performed genome-wide genotyping using various Illumina 
BeadChips. Individual genotyping of two SNPs was done using Centaurus 
assays. 

Determination of parental origin. We used the Icelandic genealogy database to 
identify the closest relatives who shared a haplotype with the proband. Parental 
origin was then assigned to the two haplotypes of a proband on the basis of a 
computed score (Methods). 

Data imputation. On the basis ofa training set of trios, by adapting the statistical 
model used by IMPUTE” to our setting, we computed allele probabilities of the 
paternal and maternal chromosomes separately for samples for which an SNP 
was not genotyped. 

Statistical analysis. For SNPs directly typed, we used likelihood-based proce- 
dures to study disease associations taking parental origin into account. For 
imputed SNPs, we used logistic regressions and t-tests. Genomic control was 
used to control for relatedness among subjects. 

Methylation analysis. Bisulphite sequencing was used to estimate the level of 
methylation. For each CpG dinucleotide, we determined the methylation status 
by calculating the C/T allele ratio at that site. 

Gene expression. We tested associated SNPs for correlation with expression of 
genes located in a one-megabase window centred on the variant, in a data set of 
expression measurements in whole blood and adipose tissue. The same expres- 
sion library was used to determine parental origin of expression by using allele- 
specific probes where the parental origin of each allele was known. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Assignment of parental origin. Let H be a haplotype for a tile T. For a particular 
proband, f(T, H) and m(T, H) were calculated as the meiotic distances to the closest 
relatives on the paternal side and, respectively, the maternal side known to carry H. 
Descendants of the parents of the proband, for example siblings of the proband, 
were excluded from this calculation. Also, a value of 10,000 was assigned when no 
relatives carrying the haplotype was found. Let A and B be the two phased haplo- 
types of the proband. The single-tile score for parental origin was calculated as 


score(T,A,B) =score(T,A)—score(T,B) 
[log (1 log(1—2-fT4)y) 
flog (1 log (1—2-f(78)y] 


A score that is greater than zero supports the assignment of A as the paternally 
inherited haplotype and B as the maternally inherited haplotype, whereas a score 
that is less than zero supports the reverse. Although it is not meant to be optimal in 


2- m(T/A)) 


59> m( T,B)) 


nature 


any formal sense, this system of scoring was chosen to have two properties. First, for 
the same absolute difference between m(T, H) and f(T, H), the absolute value of 
score( T, H) is higher when the lesser of m(T, H) and f(T, H) is smaller, thus giving 
more weight to situations in which a close relative who shared a haplotype is found. 
Second, the scoring was designed such that the result from one haplotype in one tile 
could not completely dominate the contributions from other haplotypes and adja- 
cent tiles when results were combined (see below). 

When haplotypes for n consecutive tiles, T;, ..., T;,, could be stitched together 
to form A = (Aj,..., An) and B= (B,, ..., B,,), the contig score for parental origin 
assignment was calculated as 


n 
contig-score(T),...,Tn)= s score(T;) 
i=1 
Parental origins were assigned on the basis of whether the contig score was 


greater than or less than zero. The accuracy of this procedure was evaluated 
using the trio test. 
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Growth landscape formed by perception 
and import of glucose in yeast 


Hyun Youk' & Alexander van Oudenaarden’” 


Animportant challenge in systems biology is to quantitatively describe microbial growth using a few measurable parameters 
that capture the essence of this complex phenomenon. Two key events at the cell membrane—extracellular glucose sensing 
and uptake—initiate the budding yeast's growth on glucose. However, conventional growth models focus almost exclusively 
on glucose uptake. Here we present results from growth-rate experiments that cannot be explained by focusing on glucose 
uptake alone. By imposing a glucose uptake rate independent of the sensed extracellular glucose level, we show that despite 
increasing both the sensed glucose concentration and uptake rate, the cell's growth rate can decrease or even approach zero. 
We resolve this puzzle by showing that the interaction between glucose perception and import, not their individual actions, 
determines the central features of growth, and characterize this interaction using a quantitative model. Disrupting this 
interaction by knocking out two key glucose sensors significantly changes the cell's growth rate, yet uptake rates are 
unchanged. This is due to a decrease in burden that glucose perception places on the cells. Our work shows that glucose 
perception and import are separate and pivotal modules of yeast growth, the interaction of which can be precisely tuned and 


measured. 


In 1942 Jacques Monod introduced his microbial growth model’ that 
prompted quantitative studies of microbial metabolism”. This 
motivated a wealth of mathematical models describing the growth 
of budding yeast Saccharomyces cerevisiae on the key carbohydrate 
glucose'*. These models mainly focus on the effect of glucose import 
on the growth rate. However, in addition to importing glucose, yeast 
senses extracellular glucose through several glucose sensors. These 
two key events at the cell membrane—glucose sensing and import— 
then trigger many downstream intracellular molecular events (for 
example, transcription, metabolic processes, post-transcriptional 
modifications) that collectively determine the growth rate’’. Many 
conventional models overlook this collective effect by ignoring glu- 
cose sensing. Growth behaviours that are qualitatively very different 
from current models’ descriptions may arise if glucose sensing and 
import are properly taken into account. One approach to addressing 
this deficiency is constructing detailed many-parameter models that 
attempt to explicitly track each of the vast molecular events involved 
in yeast’s glucose metabolism*°. Such an approach has provided 
detailed information on the flux of thousands of known metabolic 
reactions and new insights into yeast’s growth on glucose. However, 
it also combines the effects of glucose sensing and import because it is 
not yet known how each of the vast molecular events are altered when 
glucose import rate is varied independently of the level of extracel- 
lular glucose sensed by the cell. The enormous number of metabolites 
and reactions involved makes experimentally determining each 
molecular change due to glucose sensing and import challenging. 
Indeed, a persistent challenge in obtaining a quantitative understand- 
ing of microbial growth on nutrients has been identifying just the few 
parameters that are necessary for extracting the central features from 
this complex cellular process. A phenomenological model that 
retains just those essential parameters may provide new insights 
and central design principles’®'’ underlying microbial growth. 
Motivated by these considerations, we sought to decouple and mea- 
sure the separate effects of glucose sensing and import on cell growth, 


then provide a concise phenomenological model that determines 
how the interaction between the two determines the growth rate. 


Dependence of growth rate on glucose level 


To measure and separate out the effects of glucose perception and 
import on growth rate, we first decouple any control that glucose 
sensing has on glucose import. Such coupling primarily comes from 
the two glucose sensors (Snf3 and Rgt2)'* that drive the transcrip- 
tional regulation of the six primary hexose transporters (Hxtl—4, 6 
and 7)'?” that are responsible for glucose import (Supplementary 
Fig. 1). Our background strain lacks all the major and minor glucose 
transporter genes (hxt1-17A, agt1A, stl1A, gal2A)**, thus no sensors 
affect the transcription level of any transporter genes including the 
HXT genes. We made five ‘single- HXT’ strains by introducing one of 
the five primary HXT genes (excluding HXT7) into the background 
strain, under the control of the inducible promoter Prpg7o7. Each 
strain contains just one type of HXT gene, and its expression level 
could be controlled by the inducer doxycycline independently of 
extracellular glucose (Supplementary Fig. 2). 

We measured the log-phase growth rate of the single- HXT strains 
in minimal media containing a range of different concentrations of 
doxycycline and glucose, the concentrations of which were held con- 
stant during batch growth for each experiment. We found surprising 
behaviours in the growth rate of each single-HXT strain (Fig. 1 and 
Supplementary Fig. 3). Because glucose no longer regulates the tran- 
scription of the sole HXT gene in our strains in a complicated 
manner, one would expect that an increase in extracellular glucose 
concentration would lead to a simple increase in the single-HXT 
strain’s glucose uptake rate (when the doxycycline concentration is 
held constant). A typical conventional model" predicts that the 
growth rate should thus simply rise as the glucose level increases. 
Yet, depending on the initial glucose level, a further increase either 
increases or decreases the “Hxtl-only’ strain’s growth rate (Fig. 1). 
This is also true for the growth rates of the Hxt2-only and Hxt4-only 
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Figure 1| Growth rates of single-HXT strains do not show any systematic 
trend with respect to glucose concentration. Log-phase growth rates of the 
wild-type strain (CEN.PK2-1C) and five single-HXT strains at varying 
[glucose] but constant [doxycycline] (0 ug ml”! for wild-type and 

2.5 ug ml | for single-HXT strains) are shown. The shape of each single- 
HXT strain’s growth-rate curve is maintained over a wide range of 
doxycycline concentrations (Supplementary Fig. 3). The growth-rate curves 
of the single-HXT strains show stark differences from the wild-type’s curve: 
single-HXT strains’ growth rates can substantially decrease, and some 
strains even approach growth arrest, despite a monotonic increase in 
[glucose]. Error bars, s.e.m.; 1 = 3. 


strains. Furthermore, despite growing as well as other strains at low 
glucose levels, the HXT3-only and HXT6-only strains even approach 
growth arrest for glucose level higher than 0.02% (Fig. 1). Thus, we 
observed no systematic relationship between glucose level and 
growth rate. It is noteworthy that the wild-type strain, unlike these 
single-HXTs, simply grows faster when more glucose is present 
(Fig. 1), a behaviour we will consider more closely later. 


Dependence of growth rate on glucose uptake rate 


Using our doxycycline-inducible expression system, we were able to 
show that for every single-HXT strain at fixed doxycycline level, the 
glucose uptake rate increased as the glucose level increased (Fig. 2). To 
measure glucose uptake rates, we fused yeast-enhanced green fluor- 
escent protein (yEGFP) to the inducible HXT gene in each of the 
single-HXT strains (Supplementary Fig. 4). Measuring the average 
single-cell fluorescence in these strains gave us the relative number 
of Hxt proteins synthesized in these cells (Supplementary Fig. 5). 
Using the known Michaelis-Menten parameters of the Hxts**”°, we 
calculated the cell’s total glucose uptake rate. We also directly mea- 
sured the cell’s glucose uptake rate. The directly measured and calcu- 
lated uptake rates were in good agreement (Supplementary Fig. 6): 
glucose uptake rate increased as the glucose concentration increased 
(at constant doxycycline concentration) (Fig. 2 and Supplementary 
Fig. 7). Hence, despite a monotonic increase in both glucose uptake 
rate and extracellular glucose level, single-HXT strains at fixed doxy- 
cycline concentration can grow significantly faster, or slower, or even 
approach growth arrest as seen earlier (Fig. 1)—effects that no con- 
ventional growth model can either quantitatively or qualitatively 
describe. 


Phenomenological model of growth 

Plotting all five single-HXT strains’ growth rates and uptake rates 
together resulted in a wide scatter of data points, in which each data 
point is specified by two coordinates: uptake rate and growth rate 
(Fig. 3a and Supplementary Fig. 8). This plot shows that uptake rate 
alone cannot specify the cell’s growth rate. Specifying the glucose 
concentration by colour-coding these data points (that is, each data 


876 


NATURE|Vol 462|17 December 2009 


108 


107 


106 


105 1 1 Aa 


Glucose uptake rate (molecules s~') 
T TTtTtqtl tl 


| he ea ol | 
10-3 10-2 10-1 1 10 
[Glucose] (% w/vol) 


Figure 2 | A rise in [glucose] produces an increase in the uptake rate, but 
cells do not necessarily grow faster. To both measure and calculate glucose 
uptake rates, yEGFP was fused to the HXT gene in each single-HXT strain. 
These fluorescent single-HXT strains have the same growth-rate features as 
their non-fluorescent counterparts shown in Fig. 1 (Supplementary Fig. 4). 
The measured glucose uptake rates per cell for just three of these fluorescent 
single-HXT strains at [doxycycline] = 2.5 ug ml _' are shown here. These 
fluorescent single-HXT strains’ glucose uptake rates monotonically increase 
as [glucose] increases, despite the non-systematic behaviour of their growth 
rates reflected in Fig. 1. Hence, a cell can grow faster, or slower, or approach 
growth arrest despite an increase in both its glucose uptake rate and 
[glucose]. Error bars, s.e.m.; 1 = 3. 


point now has three coordinates: uptake rate, extracellular glucose 
concentration, growth rate) causes a notable pattern to emerge 
(Fig. 3b). This analysis shows that growth rate yu is determined by 
two independent variables: the glucose uptake rate r, and the extra- 
cellular glucose concentration g. Our full experimental data set of all 
five single-HXT strains over a wide range of glucose and doxycycline 
concentrations are described by a single equation 


(7,8) = P(g) x In (<) + He (1) 
in which pand r,are constants specifying the point of convergence of 
the log-linear lines (Fig. 3b), and the function P(g) describes the slope 
of the log-linear correlation between y and r for each value of g. This 
equation does not depend on which Hxt transporter the cell uses for 
glucose uptake. This slope P(g) increases with increasing g, and in 
turn tends to decrease growth rate (when r< r,). P(g) quantifies the 
marked effect that the extracellular glucose has on growth rate inde- 
pendently of glucose import—the effect of glucose perception. 
Qualitatively, equation (1) states that an increase in the extracellular 
glucose concentration may cause two counteracting effects: an 
increased glucose uptake rate r (which tends to increase growth rate), 
and an increased perception of extracellular glucose (which tends to 
decrease growth rate). The net result on growth rate (that is, whether 
it rises or falls) is decided by the competition between these opposing 
effects of glucose perception and uptake. Which one of the two effects 
dominates depends on the actual values of gand 1, in particular on the 
product P(g)ln(r/r,) quantifying the interaction between glucose 
perception and import (Supplementary Text). 

The ‘growth landscape’ in Fig. 3c, described by equation (1), shows 
the full set of growth rates possible for a wide range of gand r. Because 
equation (1) does not distinguish between the type and number of 
Hxt cells use for glucose import, it is applicable to cells with any 
number of HXT genes, including the wild-type, as long as the cells 
achieve the uptake rate within the range we probed. The shape of this 
landscape allows for the unusual growth-rate behaviours observed, 
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Figure 3 | Emergence of a concise growth model incorporating cell's 
perception and uptake rate of glucose, and the resulting growth landscape. 
a, b, Plotting together all of the measured growth rates and glucose uptake 
rates of the fluorescent single-HXT strains (a) then colour-coding by 
extracellular glucose level reveals this notable pattern (b). This plot shows 
that extracellular glucose concentration g, and glucose uptake rate r, are two 
independent variables. Growth rate is concisely described by the fit function 
u(r, g). P(g) is the slope of the log-linear correlation between growth rate and 
uptake rate for each g; we obtain P(g) by fitting. , and r, are constants 
specifying the point of convergence of the log-linear lines (u, = 0.44 h', 
r.= 1.4 X 10” molecules s). Error bars, s.e.m.; n = 3. ¢, Full growth 
landscape of budding yeast: Three-dimensional plot of the function (1, g). 
The growth trajectories followed by the parental wild-type (blue path, near 
the peak of this landscape), and fluorescent Hxt1l-only and Hxt6-only strains 
(orange and red paths, respectively) are shown. Coloured arrows indicate the 
direction the cell travels on each path as g increases. The arrows along the 
two axes (along ‘[glucose]’ and ‘glucose uptake rate’) point in the direction 
of increase. 


including the convex-shaped growth rate of the Hxtl-only strain 
(Fig. 3c, orange path), the Hxt6-only strain’s path towards growth 
arrest (Fig. 3c, red path), and the wild-type’s hyperbolic growth rate 
(Fig. 3c, blue path). The wild-type strain is near the peak of this 
growth landscape yet its uptake rate is not much higher than that 
achieved by some single-HXT strains. The growth landscape shows 
that some values of (g, r) cannot sustain growth (4 = 0). Indeed, for 
every g, there is a minimum uptake rate a cell needs to have for it to 
have any chance of growing in that particular glucose environment 
(Supplementary Fig. 9). 


Manipulation of glucose perception by sensors 

Whereas the glucose uptake rate depends on the Hxt transporters, 
glucose perception, captured by P(g), should depend on mechanisms 
the cell uses to measure the level of extracellular glucose. Snf3 and 
Rgt2 are two glucose sensors primarily known for regulating tran- 
scription of both major and minor glucose transporter genes'*”” 
(HXTs, GAL2, STL1 and AGT1). Because such regulation is disabled 
in our single- HXT strains, we could manipulate P(g) by knocking out 
these two glucose sensors without affecting the uptake rate r. We 
constructed a panel of single-HXT strains with these two sensors 
deleted (Supplementary Fig. 10). The relationship between growth 
rates and extracellular glucose concentration in these ‘sensorless’ 
strains is notably different from that in strains with the two sensors 
intact (Fig. 4a and Supplementary Fig. 11). Growth rates now generally 
increase as the glucose level increases (at constant doxycycline level). 
Moreover, without the sensors, the Hxt3-only and Hxt6-only strains 
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no longer approach growth arrest as the glucose level increases. Because 
we deleted all minor glucose transporter genes and removed the glu- 
cose’s control of the sole transporter expression in our single-HXT 
strains, changes in uptake rate were not the reason for the growth 
rescues we observed. For every combination of glucose and doxycy- 
cline concentrations, the uptake rate of the sensorless strains was nearly 
identical to that of their sensor-containing counterparts (Fig. 4b and 
Supplementary Fig. 12). 

In the sensorless strains, growth rate again explicitly depends on 
glucose concentration but with much reduced sensitivity (Fig. 4c, d). 
When Snf3 and Rgt2 are absent, a cell in 4% glucose acts as if it were 
in 0.06% glucose with intact sensors. Because the uptake rate remains 
virtually unchanged in the single- HXT strains when SNF3 and RGT2 
are deleted, this reduced-sensitivity effect is due to a change in the 
perception function P(g), not the uptake rate r (Fig. 4d). The remain- 
ing dependence of the cell’s growth rate on the glucose concentration 
even after Snf3 and Rgt2 have been deleted suggests that other sensors 
may contribute to the effect embodied in P(g)**”’. Nonetheless, our 
experiments show that Snf3 and Rgt2 are the key determinants of 
P(g) (as quantified in Fig. 4d). 

The behaviour depicted by equation (1) should apply to the wild- 
type strain as well, as long as it achieves an uptake rate within the 
range probed with the single-HXT strains used to construct our 
growth landscape. We measured the wild-type’s uptake rate and 
found that it was below the critical uptake rate r, for glucose con- 
centrations smaller than 0.02% (Fig. 3b and Supplementary Fig. 16). 
For higher [glucose], the uptake rate exceeds r,. When the wild-type 
cell’s uptake rate is below r,, its growth rate fits with the trend shown 
in Fig. 3b. For higher glucose concentration, the effect of perception 
on the wild-type’s growth rate disappears (Fig. 3b). One possible 
explanation is that as long as the glucose concentration is not too 
low, the wild-type escapes the seemingly detrimental effect of per- 
ception on growth rate by making enough hexose transporters to go 
beyond r,. But for lower glucose level in which its uptake is less than 
To it properly tunes the interaction between glucose perception and 
uptake (quantified by the product P(g)In(1/r,)) such that its growth 
rate will increase when the cell perceives more extracellular glucose. 
Such tuning suggests that the transcriptional regulation of the HXT 
genes by Snf3 and Rgt2 is organized so that the wild-type always 
climbs uphill in the growth landscape (Fig. 3c) as it perceives an 
increase in the extracellular glucose concentration. 

The critical point (1, 7.) may represent a region of phase transition 
in the cell’s growth and metabolism. The cell markedly increases its 
ethanol production rate as its uptake rate increases above the critical 
rate r, (Supplementary Fig. 17). This suggests that when its uptake 
rate is below r,, the cell metabolizes glucose mainly by respiration, but 
then switches to a largely fermentative metabolism as the uptake rate 
exceeds r,. A key rate limiting step in fermentation is the import of 
glucose, and therefore the cell only redirects its glucose flux from 
respiration to fermentation when its glucose uptake rate is suffi- 
ciently high*®*’. Our results indicate that this major redistribution 
of flux occurs around r,. 


Discussion 


Glucose perception and import are two separable modules that each 
affect the growth rate, but it is the interaction between them that ulti- 
mately determines the cell’s growth rate, and that interaction can be 
both precisely altered and measured. The question remains as to why it 
would make sense for yeast to grow according to equation (1), which 
allows for a possible detrimental growth if the interaction between the 
perception and import modules is not properly tuned. One explanation 
may be that yeast has no way to directly ‘measure’ its glucose import rate 
in real-time. Indeed, there is no known ‘flux sensor’ that yeast uses to 
measure its glucose import rate in real-time and then adjust the pro- 
duction level of Hxt transporters to change the glucose import rate if it 
senses that the flux is too low. In fact, Hxt expression levels are primarily 
set by the extracellular glucose concentration** (Supplementary Fig. 1). 
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Figure 4 | Manipulation of the cell's perception of extracellular glucose, 
leaving uptake rate unperturbed, can yield significant growth-rate 
changes. a, Growth rates of single-HXT strains lacking two glucose sensors 
(snf3A rgt2A, bold lines) along with their counterparts with intact sensors 
(dotted lines) are shown for [doxycycline] = 5 jig ml‘. Error bars, s.e.m.; 
n = 3. b, Knocking out the two glucose sensors leaves the cell’s glucose 
uptake rate virtually unperturbed. Just the Hxtl-only and Hxt2-only strains 
are shown here for simplicity (see Supplementary Figs 7 and 12 for others). 
Each data point represents a particular combination of glucose and 
doxycycline concentrations. Error bars, s.e.m.; n = 3. ¢, By yEGFP fusion, 


Although yeast certainly can measure the extracellular glucose level 
directly and the intracellular glucose level indirectly (for example, 
through the catabolite-repressor Mig1 that uses intracellular glucose 
as its substrate)***°, knowing the two glucose levels is not sufficient 
for yeast to infer what its glucose import rate is. This is because a given 
steady-state glucose concentration gradient can be maintained by a 
combination of wide ranges of glucose import rate and intracellular 
glucose breakdown rate. Because the cell has no direct way to measure 
the breakdown rate (there is no known ‘rate sensor’ measuring intra- 
cellular glucose breakdown), the cell cannot infer what the glucose 
import rate is in real-time just from the difference between extracellular 
and intracellular glucose. Given the engineering difficulty of building 
flux sensors, yeast may have solved the problem by evolving glucose 
sensors such as Snf3 and Rgt2 to measure the extracellular glucose level, 
then anticipate a certain glucose import rate would be achieved, set up 
intracellular activities to process glucose being imported at the antici- 
pated rate, and make sure that such an import rate is indeed achieved by 
putting its HXT genes under the control of those glucose sensors 
(Supplementary Fig. 1). 

Continuing efforts at large-scale modelling of glucose metabolism, 
gene regulation’® and cellular signalling must decouple and consider 
how the cell’s response varies when glucose uptake rate is varied inde- 
pendently of extracellular glucose level. For instance, microarray studies 
have shown that hundreds of genes involved in ribosomal biogenesis, 
which are energetically very costly, are upregulated several fold as yeast 
are subjected to ever increasing levels of glucose’’. In these studies, as the 
level of glucose is increased, so does the glucose import rate. These 
observed large-scale changes are thus due to the conflated effects of 
glucose perception and import. It would be interesting to measure 
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fluorescent sensor-less single-HXT strains were constructed for comparison 
with their sensor-intact counterparts. The features of growth rates in a were 
preserved after this fusion (Supplementary Fig. 11). Growth rates and 
glucose uptake rates of these strains were measured (Supplementary Figs 
12-15). For comparison, data for the sensor-intact single-HXT strains (from 
Fig. 3a) are shown in grey (ji, = 0.40 h', 7 = 1.4 X 10’ molecules s~'). 
Error bars, s.e.m.; n = 3. d, The sensitivity function P(g), calculated from 
fitting the data in c and Fig. 3a is shown for strains with intact sensors (black) 
and snf3A rgt2A strains (red). Error bars indicate 95% confidence interval in 
these fits. 


which of these changes are due to glucose perception and import sepa- 
rately by decoupling the two effects. We hope that our model, as well as 
the framework used to extract some key principles from the complexity 
underlying yeast growth, will assist continuing efforts to rationally 
engineer**“° and understand microbial metabolism at the systems 
level*!*. 


METHODS SUMMARY 

Growth rate measurements. Growth rates were measured while the cells were in 
log-phase growth in 5 ml batch cultures at 30 °C using synthetic media supple- 
mented with the desired doxycycline and glucose concentrations. These concen- 
trations remained nearly constant during growth (Supplementary Information). 
Using a spectrophotometer (Hitachi U-1800), we measured the absorbance at 
600 nm (Agoo nm) Of these batch cultures over time, and extracted the growth rate 
of the cells. 

Glucose uptake rate measurements and calculations. Glucose uptake rates 
were determined by measuring the rate of glucose depletion in the growth 
medium while the cells were in log-phase growth. It can be shown 
(Supplementary Information) that the glucose uptake rate per population den- 
sity of cells (in units: mM h-! Agoonm ) is approximately r(Go)~ & a in 
which p(t) — po is the measured change in A¢oo nm of the cell culture after time é 
is the log-phase growth rate, and Gp — G(t) is the depleted glucose concentra- 
tion in the growth medium after time t. This depleted glucose concentration was 
measured using a standard commercial glucose assay kit (Sigma) that is based on 
the conversion of glucose by hexokinase and NADP*-dependent glucose-6- 
phosphate-dehydrogenase. We compared the measured glucose uptake rates 
with the uptake rates calculated using an independent method for the fluorescent 
single-HXT and wild-type strains. We calculated the glucose uptake rates by 
using the known Michaelis-Menten parameters (Vinax and K,,) of Hxt transpor- 
ters’° and the relative number of Hxt proteins per cell determined by measuring 
the average single-cell yEGFP fluorescence (Supplementary Information). These 
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comparisons showed a close agreement between our measured and calculated 
uptake rates (Supplementary Figs 6 and 14). 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Strain background and construction. A list of strains with diagrams summarizing 
their key features is provided in the Supplementary Information. All strains were 
derived from the haploid strain CEN.PK2-1C (MAT«, gift from E. Boles)”, referred 
to as the ‘wild-type’ in our study. Both EBY.VW4000 and EBY.VW5000 are deficient 
in hexose transport owing to deletions of all HXT genes as well as genes encoding 
transporters with minor glucose uptake capabilities (agt1A ydl247wA yjr160A)”*. 
HY4D1 and HY5F1 each contain reverse tetracycline-controlled transactiva- 
tor (rtTA) protein expressed constitutively by the MYO2 promoter (inserted 
into EBY.VW4000 and EBY.VW5000 respectively using plasmid pDH18 
(EUROSCARF) containing HIS5 gene) and CFP constitutively expressed by 
Prep Xhol-Pygro7-BamHI, BamHI-HXTn-Notl fragments were cloned into 
pRS305 (EUROSCARF) backbone containing the LEU2 gene (n= 1-4, 6). 
Integrating these plasmids into the defective LEU2 locus (leu2—3) in HY4D1 by 
linearizing the plasmids with NarlI, the single-HXT strains were constructed. To 
construct fluorescent single-HXT strains, the yEGFP-Tapri-Kan fragment was 
amplified from the pKT127 plasmid (EUROSCAREF) and fused to the carboxy 
terminus of HXTn open-reading frame (ORF) in each of the single-HXT strains 
by standard PCR integration”. This fragment was also fused to C terminus of HXTn 
ORF (n= 1-4, 6, 7) in CEN.PK2-1C, thus resulting in six fluorescent wild-type 
strains (Supplementary Fig. 16). The ‘sensorless’ versions of single-HXT strains 
(snf3A rgt2A) were constructed in the same way as their sensor-intact counterparts 
mentioned earlier, by using HY5F1 instead of HY4D1. To probe the wild-type’s 
transcriptional regulation of each of the HXT genes (Supplementary Fig. 1), Xhol- 
Prxtn-BamHI, BamHI-YFP-Notl fragments were cloned into pRS305 backbone 
containing the LEU2 gene (n = 1-4, 7) and integrated into the defective LEU2 locus 
(leu2—3) in CEN.PK2-1C by linearizing the plasmid with either Narl (for n = 1) or 
Clal (for all other n), resulting in five strains. The Piyrp Puxra Puxrs Puxry and 
Prxr7 promoter sequences refer to 1,941-, 850-, 1,996-, 1,544- and 2,042-base pairs 
upstream of the start codon of the respective genes. These sequences include all the 
known binding sites of transcription factors for the respective genes”. 

Growth rate measurements. All growth rates reported in our study were mea- 
sured while the cells were in log-phase growth in 5 ml batch cultures at 30 °C, ina 
standard synthetic media with various combinations of glucose and doxycycline 
concentrations. To bring the cells into log-phase, the single-HXT strains were 
first grown in a standard synthetic media containing 2% maltose and the desired 
concentration of doxycycline until the cells have been in log-phase for roughly 
12h. This procedure ensured that the cells were already making Hxt proteins 
needed to initiate glucose uptake immediately after being transferred to glucose 
media. Then these cells were diluted into the standard synthetic media with the 
same amount of doxycycline, but this time containing glucose instead of maltose. 
These dilutions were done such that by the time the density of cells in the batch 
culture reached a level detectable by our spectrophotometer (Hitachi U-1800) 
(roughly 15 h after dilution), the cells had adjusted to the glucose media and were 
in log-phase growth. Hence, the transient growth rate change associated with 
maltose to glucose media transfer did not enter into our growth rate measure- 
ments. In a separate experiment, we confirmed this was indeed the case by 
further diluting these cultures into an identical glucose media, which showed 
that having the cells pre-grown in maltose before did not affect the growth rates 
reported in our study. By measuring the Agoo nm of these batch cultures over time, 
we extracted the growth rate of the cells. Strains that approached growth arrest 
also went through the same procedure as above. After transfer from maltose to 
glucose media, these cells’ growth rates transiently decreased to nearly zero 
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during a period of roughly 24h. By looking at the cells under the microscope, 
no abnormal cell morphologies were detected, thus indicating normal growth 
(that is, no pseudohyphal or filamentous growth was detected). 

Fluorescence measurements. The average single-cell fluorescence due to yEGFP 
fused to the C terminus of HXT genes in the wild-type and single-HXT strains 
was measured using a Becton Dickinson FACScan flow cytometer with excitation 
laser at 488nm. Emission filter FL1 (530/30) was used to detect the yEGFP 
fluorescence levels as well as the YFP for determining the P;z797induction curves 
in the calibration strains HY4DCal5 and HY5FCal2. Before observation using 
FACScan, the strains were grown using the protocol outlined in the ‘growth rate 
measurements’ section. The mean fluorescence values reported in our study 
represent the steady-state levels of Hxt proteins in single cells, as no appreciable 
changes in fluorescence was detected while the cells were growing in log-phase. 
Glucose uptake rate measurements and calculations. Glucose uptake rates of 
cells were determined by measuring the rate of glucose depletion in the growth 
medium while the cells were in log-phase growth. First, the reasoning behind this 
procedure is as follows: If the cell’s growth rate at glucose concentration Go is 
G() is the concentration of glucose in the growth medium at time ¢, r(G(1)) is the 
uptake rate per absorbance of the cells as a function of extracellular glucose, and 
ro is the absorbance of cells at t= 0, then the decrease in glucose concentration in 
the growth medium over time f is 


t 


G—G(t)= | GE)py exp (ura (2) 
0 
If this change in glucose concentration is sufficiently small, but large enough 
to be detectable by our chemical assay (described below), then we can approx- 
imate r(G(t))~r(Go) and jz as a constant during the time interval t. Then equa- 
tion (2) can be solved for r( Go): 


~ (G, —G(t)) 


r(Gy) = 
aaa Po 
in which r(Go) is the uptake rate per Agoonm Measured in units of 
mMh! Agoo nm. This was then converted into molecules s” ! cell”! using con- 
version factor 1.7 X 10’ cellsml~! Agoonm |» P(£) — pois the change in Agoo nm Of 
the cells measured using the spectrophotometer (Hitachi U-1800), and ju is the 
growth rate determined by the method mentioned previously. The change in 
glucose concentration Gy — G(t) was measured using the standard commercial 
glucose assay kit (Sigma) based on conversion of glucose through hexokinase and 
NADP* dependent glucose-6-phosphate-dehydrogenase. We compared the 
measured glucose uptake rates with the uptake rates calculated using an inde- 
pendent method for the fluorescent single-HXT and wild-type strains. We calcu- 
lated the glucose uptake rates by using the known Michaelis-Menten parameters 
(Vinax and K,,) of Hxts”® and the relative number of Hxt proteins per cell inferred 
from measuring the average single-cell yEGFP fluorescence (Supplementary 
Information). These comparisons showed a close agreement between our mea- 
sured and calculated uptake rates (Supplementary Figs 6 and 14). 
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Transport mechanism of a bacterial 
homologue of glutamate transporters 


Nicolas Reyes’, Christopher Ginter’ & Olga Boudker’ 


Glutamate transporters are integral membrane proteins that catalyse a thermodynamically uphill uptake of the 
neurotransmitter glutamate from the synaptic cleft into the cytoplasm of glia and neuronal cells by harnessing the energy of 
pre-existing electrochemical gradients of ions. Crucial to the reaction is the conformational transition of the transporters 
between outward and inward facing states, in which the substrate binding sites are accessible from the extracellular space 
and the cytoplasm, respectively. Here we describe the crystal structure of a double cysteine mutant of a glutamate 
transporter homologue from Pyrococcus horikoshii, Gltp,,, which is trapped in the inward facing state by cysteine crosslinking. 
Together with the previously determined crystal structures of Gltp, in the outward facing state, the structure of the 
crosslinked mutant allows us to propose a molecular mechanism by which Gltp,, and, by analogy, mammalian glutamate 


transporters mediate sodium-coupled substrate uptake. 


Glutamate is the predominant excitatory neurotransmitter in the 
brain responsible for learning, memory formation and higher cog- 
nitive function. Specialized membrane proteins—glutamate trans- 
porters (also termed excitatory amino acid transporters (EAATs)) 
—mediate the transmitter uptake from the extracellular space into the 
cytoplasm of astrocytes and neurons against steep concentration 
gradients to allow for rounds of neurotransmission and to prevent 
glutamate-mediated excitotoxicity’. EAATs are members of a ubiqui- 
tous solute carrier 1 (SLC1) family of secondary solute transporters’, 
which catalyse concentrative uptake of the acidic and neutral amino 
acids and dicarboxylic acids in a reaction coupled to symport of 
sodium and/or protons and, in the case of EAATs, antiport of pota- 
ssium. The conceptual mechanisms of protein-mediated solute trans- 
port were developed over 40 years ago’ and focus on the ability of 
transporters to undergo isomerization between two states: an outward 
facing state, in which the substrate-binding site is accessible from the 
external solution, and an inward facing state, in which it is reached 
from the cytoplasm. Although our understanding of the molecular 
mechanisms of transport have advanced significantly in the last 
decade®’, the structural transitions between the outward and inward 
facing states remain poorly characterized. 

The sodium/aspartate symporter from Pyrococcus horikoshii 
(Gltp;,), an archaeal homologue of the EAATs, was one of the first 
sodium-coupled transporters for which structure had been deter- 
mined at a near-atomic resolution*’. The structural analysis of 
Gltp, revealed that individual protomers assemble into a homotri- 
mer, forming a deep solvent-filled bowl open to the extracellular 
solution and reaching approximately half way across the lipid bilayer 
(Fig. la). The first six transmembrane (TM) segments of each Gltp, 
protomer mediate all inter-subunit contacts and form a distorted 
cylinder within which the highly conserved carboxy terminus of 
the protein is folded into a compact core. The core consists of an 
intracellular re-entrant helical hairpin (HP) 1, TM7 with an 
unwound segment, an extracellular hairpin, HP2, and an amphi- 
pathic TM8. Despite the trimeric assembly, shared by all character- 
ized glutamate transporter homologues'’”’, protomers function 
independently’*’. Consistently, in Gltp, crystal structure an 
L-aspartate (L-Asp) molecule and two sodium ions are bound within 


the core of each individual protomer. Positioned between the tips of 
HP1 and HP2 at the bottom of the extracellular bowl, the substrate is 
occluded from the aqueous environment only by HP2. Notably, in a 
crystal structure of Gltp, in complex with a competitive blocker, 
L-threo-f-benzyloxyaspartate (L-TBOA)”"*, HP2 assumes an open 
conformation, revealing its dynamic nature and suggesting that it 
may serve as an extracellular gate. Bound L-Asp and L-TBOA are 
separated from the cytoplasm by over 15 A of acompact protein core, 
indicating that these structures correspond to the outward facing 
state of the transporter. However, the structure of the inward facing 
state and the manner in which the substrate and sodium ions are 
released into the intracellular solution remain unknown. 


Crosslinking cysteines in TM2 and HP2 

In a study predating the crystal structure determination of Gltpy, it 
was reported that two cysteines placed in TM2 and HP2 of EAAT1 
formed a spontaneous disulphide bond, suggesting close proximity 
of these residues'’. Notably, in the crystal structures of L-Asp-and 
L-TBOA-bound Gltp, the corresponding residues K55 in TM2 and 
A364 in HP2 are over 25 A apart and cysteines at these positions are 
unlikely to become crosslinked. Moreover, the inter-subunit residue- 
to-residue distances are also 25-30A (Fig. la). Intrigued by these 
results, we reasoned that TM2 and HP2 become juxtaposed in an 
as yet structurally uncharacterized functional state of the transporter 
and set out to reproduce the double cysteine mutation in Gltp, for 
crystallographic studies. The K55C/A364C mutant was generated 
within a cysteine-less Gltp,, expressed in Escherichia coli and purified 
in detergent solution. Oxidative crosslinking of Gltp,(K55C/A364C) 
in the presence of copper 1,10-phenanthroline (CuPhen) yielded 
distinct protein species with a higher electrophoretic mobility 
(Fig. 1b), which we identified as intra-molecularly crosslinked 
Gltp, protomers (Supplementary Fig. la—c). Interestingly, CuPhen- 
catalysed disulphide bond formation is greatly facilitated when the 
transporter is purified in the absence of sodium ions (Na~) and 
L-Asp, suggesting that the state in which residues 55 and 364 are 
proximal is more populated under these conditions (Fig. 1b and 
Supplementary Fig. 2a). Qualitatively similar results were also 
obtained using unpurified K55C/A364C mutant within E. coli crude 
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Figure 1| Crosslinking of Gltp,(K55C/A364C). a, Cartoon representation 
of two substrate-bound Gltp,, protomers (Protein Data Bank accession 
2NWX) viewed in the membrane plain. The third protomer and TM4 of the 
protomer on the right are removed for clarity. TM1—-TM6 are coloured 
salmon and wheat; the C-terminal cores are light blue; and the hairpins HP1 
and HP? are dark blue. Bound L-Asp and Na” are shownas sticks and purple 
spheres, respectively. Orange spheres correspond to the Cx atoms of 
residues 55 and 364. All molecular representations have been generated 
using Pymol**. b, SDS-PAGE analysis of Gltp),(K55C/A364C) before and 
after incubation with 100 |1M CuPhen for indicated periods of time. 
Detergent-solubilized purified Gltp,(K55C/A364C) (left) and unpurified 
transporter in crude E. coli membranes (right) were visualized by Coomassie 
staining and western blotting, respectively. c, Crosslinking of Gltp,(K55C/ 
A364C) in the presence of 50 uM HgCl,. Samples were analysed as in b. 


membranes, demonstrating that Gltp,, like EAAT1, adopts the new 
conformation in the context of a lipid bilayer. Moreover, cysteines at 
structurally adjacent positions did not form a disulphide bond, suggest- 
ing that the distance shortening between positions 55 and 364 occurs in 
a highly specific manner (ref. 19 and Supplementary Fig. 2b). Notably, 
incubation of Gltp,(K55C/A364C) with divalent mercury (Hg’*), 
which serves as a bi-functional thiol-specific crosslinker, yields an essen- 
tially complete crosslinking both in the presence and in the absence of 
Na” and L-asp in detergent and in crude membranes (Fig. 1c). 
Furthermore, crosslinking is completed within seconds, suggesting that 
both substrate-free and bound Gltp;, rapidly sample the crosslinking- 
competent state. 


cyst structure of the crosslinked Gltp,(K55C/A364C) 


Hg” * crosslinked K55C/A364C mutant (Gltp,(55C/364Cy¢)) in the 
presence of Na* and L-Asp yielded crystals diffracting to 3.5-3.9A 
(see Supplementary Table 1). A previous extensive study suggested 
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that the functionally relevant conformational transitions of a related 
bacterial glutamate transporter do not involve regions engaged in 
protein trimerization, because crosslinking of multiple interfacial 
residues had no effect on substrate transport’’. Hence, we used a 
trimeric Gltp,, model comprised of TM2, TM4 and TM5, which are 
responsible for all inter-subunit contacts (Fig. la), to search for a 
molecular replacement solution. The initial phases yielded distinct 
peaks within the anomalous difference Fourier map corresponding 
to Hg atoms at positions adjacent to cysteine 55 in TM2. Rounds of 
manual building and refinement resulted in a protein model lacking 5 
and 9 residues on the amino and carboxy termini, respectively, and 
including 98% of the side chains. To verify the model, we crystallized 
the selenomethionine-substituted Gltp,(55C/364Cyg). The Se peaks 
within the anomalous difference Fourier maps are in perfect agree- 
ment with the location of 17 methionines in the protein model 
(Supplementary Fig. 3a). The distance between the Ca atoms of 
modelled cysteines 55 and 364 is 7.4 A, and the Hg peak within the 
anomalous difference Fourier map is positioned between these resi- 
dues with the sulphur to mercury distances estimated at 2-2.3A 
(Supplementary Fig. 3b), similar to those observed in small molecu- 
lar mass compounds” and peptides*’”*. 


Gltp,(55C/364C,,,) is captured in the inward facing state 


The most striking structural feature of Gltp,(K55C/364Cy,) compared 
to the wild-type Gltp,, is an approximately 18 A movement of the sub- 
strate binding sites from near the extracellular solution at the bottom of 
the extracellular bowl to near the cytoplasm (Fig. 2a, b). Structural 
superposition using transmembrane segments TM1, TM2, TM4 and 
TM5 from all three protomers reveals that these segments remain 
largely unchanged, with the root mean squared deviation (1.m.s.d.) 
for the atomic positions of 1.2 A. The remainder of the protein, includ- 
ing TM3, TM6, HP1, TM7, HP2 and TM8, does not align and has 
undergone a substantial movement (Fig. 2a). Remarkably, when these 
regions alone are superposed, their structures are essentially identical, 
yielding r.m.s.d. of 0.6 A. These results suggest that Gltp, can be par- 
titioned into two structural domains, which we termed the trimeriza- 
tion and transport domains (Fig. 2c). The rigid body movements of the 
transport domains relative to the trimerization domain dominate the 
structural changes between the substrate-bound wild-type Gltp, and 
Gltpp(55C/364Cy,) (Fig. 2a, b and Supplementary Movie). The struc- 
tural environment of the ion- and substrate-binding sites, which are 
positioned entirely within the transport domains, is preserved. The 
F, — F, difference Fourier map reveals excess electron density at these 
sites (Supplementary Fig. 4), suggesting that the transporter remains 
bound to 1-Asp and to two Na‘ ions. Consistently, substrate-free 
Gltpp(55C/364Cy,) in solution binds L-Asp with high affinity and in 
a sodium-dependent manner (Fig. 2d). In the structure of Gltp,(55C/ 
364Cy4,) the substrate binding sites lie near the bottom of deep crevices 
formed within each protomer between HP1 and the cytoplasmic 
portion of TM7 of the transport domain and TM2 and TM5 of the 
trimerization domain (Fig. 2b). Bound L-Asp and Nal are completely 
occluded from the solution by the tips of HP1 and HP2, whereas Na? is 
partially exposed to the solvent (Fig. 2a, b). Because substrate and ion 
binding sites are near the intracellular solution, we propose that the 
conformation of Gltp,(55C/364Cy,) corresponds to an inward facing 
state of the transporter. 

Because the trimerization domain undergoes little conformational 
transition, it is unlikely to change its position relative to the mem- 
brane plane. Hence, we estimate that a ~25 A thick apolar region of 
the lipid bilayer, which in the wild-type Gltp, aligns approximately 
with the hydrophobic portion of TM1, is similarly positioned in 
Gltp,(55C/364Cy,). Such bilayer placing suggests that several polar 
residues at the extracellular ends of the lipid-facing TM3 and TM6 
move into the hydrophobic region of the membrane. It is possible 
that the lack of the lipid bilayer in the crystals removes important 
constraints, resulting in a non-native positioning of the transport 
domain. However, crosslinking of residues 55C and 364C in Gltp, 
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Figure 2 | Gltp,(55C/364C,,,) in the inward facing substrate-bound state. 
a, b, Cartoon representation of the single protomers (a) and surface 
representation of the trimers sliced through the binding sites (b). Wild-type 
(WT) Gltp, and Gltp,(55C/364Cy,) are shown on the left and right, 
respectively. In a, TM1, TM2, TM4 and TM5 are coloured wheat, with the 
remainder of the protomer light blue. Na” ions are shown as purple spheres. 
c, Extracellular view of Gltp,(55C/364Cy,) with straight and curved lines 
delineating individual protomers and transport domains, respectively. The 
trimerization domains (wheat) and transport domains (blue) are connected 
by short cytoplasmic (filled arrows) and extracellular (open arrows) loops, 


and corresponding residues in EAAT1 occurs efficiently in lipid 
membranes, suggesting that a sufficient range of motions of the 
transport domain is allowed. Movement of the transport domain 
towards the cytoplasm also significantly increases its exposure to 
the intracellular solvent. Notably, although the total exposed cyto- 
plasmic surface area increases from ~1,300 A? in the wild type to 
~4,200 A? in Gltpp,(55C/364Cy,), the fraction of the apolar area 
changes only modestly from 57% to 63%. Both values are well within 
the range reported for the solvent-exposed area of soluble proteins”. 


Domain interaction interfaces 


Both wild-type Gltp, and Gltpp(55C/364Cy,) bury approximately 
equal surface areas of ~2,500 A’ on the interface between the trimeri- 
zation and transport domains. Furthermore, in both structures 
essentially the same residues of the trimerization domain are 
involved, forming a smooth relatively featureless interaction surface, 
which is over 80% hydrophobic (Fig. 3a, b). Although TM1 and TM4 
contribute to the contact area, TM2 and TMS, lying parallel to each 
other and crossing the membrane at an oblique angle, provide the 
bulk of the interactions, contributing 77% and 67% to the interaction 
surface in the wild type and Gltp;,(55C/364Cy,), respectively. In con- 
trast, the transport domains in the two structures present interaction 
interfaces with little overlap, but similar in area (1,250 A’) and 
hydrophobicity (70%). They are dominated by HP1 and the cyto- 
plasmic portion of TM7 in the wild-type Gltp;, (Fig. 3a), and by HP2 
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highlighted in red. The long TM3—TM4 loops (arrowhead) cross over the 
transport domains. d, Isothermal titration calorimetry analysis of L-Asp 
binding to Gltp,(55C/364Cy,). Shown are the binding heats in 10 mM NaCl. 
The linear dependence (slope = —2.6 + 0.7) of the log of the apparent 
dissociation constant, Kg, on the log of Na* concentration is shown in the 
inset. Gltp,(55C/364Cy,) was exchanged into Na*/ L-Asp-free buffer, diluted 
to 15-20 [1M in the reaction cell of the Microcal ITC”, supplemented with 
indicated concentrations of Na’, and titrated with L-Asp at 25 °C. The 
binding enthalpy and the apparent number of binding sites were 23.6 + 0.8 
kcal mol | and 0.4 + 0.03 (n = 6), respectively. 


and the extracellular portion of TM8 in Gltp,(55C/364Cy4,) (Fig. 3b), 
which provide 92% and 88% of the total buried area, respectively. 
The analysis of the evolutionary conservation reveals that the inter- 
acting residues within the trimerization domain as well as within the 
two alternative interfaces of the transport domain are all highly con- 
served (Fig. 3c, d). The high conservation of HP2 surface residues and 
the reported transport inhibition of the mammalian EAATs upon 
chemical modification of cysteines in HP2 (refs 24-28) are consistent 
with this region being involved in specific protein contacts as 
observed in Gltp,(55C/364Cy,). The disengagement of HP1 and 
the N-terminal part of TM7 and their transition to the cytoplasmic 
surface of the transporter is also consistent with the previously 
reported intracellular solvent accessibility of these regions in related 
prokaryotic and mammalian transporters”. 

It has been noted previously that the C-terminal core of Gltp, 
contains structurally symmetrical elements*. Specifically, HP1 and 
the N-terminal half of TM7 can be superimposed on HP2 and the 
N-terminal portion of TM8 with r.m.s.d. of 2.5 A. It is precisely these 
triplets of helices that constitute the two alternative interaction inter- 
faces of the transport domain (Fig. 3a, b). Notably, also the 
N-terminal portion of the transporter, comprised of the first six 
transmembrane segments, exhibits a pseudo-two-fold structural 
symmetry despite the lack of detectable sequence conservation. 
TM4-TM6 can be superimposed upon TM1-TM3 with r.m.s.d. of 
~6 and 4.3 A for the wild type and mutant, respectively (Fig. 4a, b). 
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Hence, unlike other characterized secondary transporters’, glu- 


tamate transporters contain not one but two inverted structural 
repeats (Fig. 4d). Because of the N-terminal symmetry, the interaction 
interface of the trimerization domain can be partitioned into struc- 
turally related extracellular and cytoplasmic portions (Fig. 4c), which 
contribute approximately equally to the total buried surface area. 


Hinge movements 


Within the transport domain, structurally symmetrical TM3 and 
TM6 serve as two arms holding the transporter core and extending 
from the trimerization domain: the former from the cytoplasm and 
the latter from the extracellular solution (Fig. 4e, f). The loops, which 
connect these transmembrane segments to the trimerization domain, 
enable the inward movement. The long flexible loop between TM3 
and TM4 is poorly structured in the L-Asp-bound wild-type Gltp, 
crystals, and in Gltp,(55C/364Cy4,) its structure is largely determined 
by crystal contacts. In contrast, the structurally symmetrical loops 
between TM2 and TM3 and between TM5 and TM6 are short and 
undergo well-defined transitions to accommodate the movements of 
the transport domain (Fig. 4e, f and Supplementary Movie). In 
Gltpp(55C/364Cy,), unwinding of the TM3 «-helix by approximately 
one turn extends the TM2-TM3 loop by four residues and allows for 
the descent of TM3 towards the cytoplasm. In contrast, the TM5- 
TM6 loop is shortened by two residues as a result of simultaneous 
unwinding of TM5 by one-half of a turn and extension of TM6 by a 
turn ofa helix. Both loops contain conserved glycines—G69, G79 and 
G82 in the TM2-TM3 loop and G221 in the TM5—-TM6 loop—which 
may facilitate the folding/unfolding of the helices and serve as hinges. 
Indeed, the transport domain movement can be decomposed into a 
16A descent towards the cytoplasm followed by a 37° rotation 
around an axis passing simultaneously through the transport domain 
centre of mass and loops TM2-TM3 and TM5-TM6. Consistent with 
the importance of the structural re-arrangements in the loop regions, 
chemical modifications of cysteines in TM5—-TM6 loop of EAAT3 
inhibited uptake*’. 
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Figure 3 | Domain interaction 
surfaces. a, b, Trimerization 
domains of wild-type Gltp, (a) and 
Gltpy(55C/364Cy,) (b) are shown 
in surface representation and 
coloured wheat. Residues involved 
in domain contacts, identified by 
ProFace server*’, are coloured blue 
(TM1 and TM2) and green (TM4 
and TM5). Interacting structural 
elements of the transport domain 
are shown in ribbon representation: 
HP1 (yellow) and TM7 (orange) in 
wild-type Gltp,; HP2 (red) and TM8 
(pink) in Gltp,(55C/364Cy,). 

c, d, Surface representation of the 
trimerization (c) and transport 

(d) domains coloured according to 
evolutionary conservation. Dark 
blue and red correspond to the 
highly conserved and variable 
residues, respectively. The 
interacting surfaces are facing the 
viewer, and the white arrows mark 
the highly conserved serine-rich 
signature motif in HP1. 
Conservation scores were 
calculated using the Consurf 
server*’, and 212 SLC1 sequences 
with less than 60% identity were 
harvested from the Pfam database” 
and aligned in ClustalW2”. 


TM7 


Transport mechanism 


In the structure of wild-type Gltp,, L-Asp is bound ~5 A beneath the 
surface of the extracellular bowl occluded from the solution by HP2. 
In contrast, in the structure of Gltpy,(55C/364Cy,), L-Asp is bound 
~5 A beneath the intracellular surface occluded from the solution by 
HP1. Hence, we propose that the structure of L-Asp-bound Gltp), 
corresponds to the state right after binding of the substrate from the 
extracellular side (outward-facing occluded) and the Gltp,(55C/ 
364Cy,) structure corresponds to the state right before the release 
of the substrate into the cytoplasm (inward-facing occluded) 
(Fig. 5). The isomerization between these states involves a combina- 
tion of a cross-membrane movement and rotation of the transport 
domain, comprised of the substrate-binding transporter core and 
peripheral TM3 and TMé6. During this movement, the lipid-facing 
hydrophobic transmembrane segments, TM3 and TM6, traverse the 
bilayer directly, moving towards the cytoplasm by ~12 A. In contrast, 
passage of the relatively polar HP 1 and HP2, with their tips crossing as 
much as 20A of the lipid bilayer, is facilitated by the intra-protein 
track provided by the trimerization domain. The transport domains 
are expected to move stochastically and independently within each 
protomer, so that the crystal structure represents a statistically rare 
case, when all three transport domains are in the inward orientation. 
The rigid trimerization domain is anchored in the membrane and 
provides a counterbalance to the movements of the bulky transport 
domains, suggesting an explanation for the obligatory oligomeric 
assembly of SLC1 family of transporters. 

Substrate binding/unbinding on either side of the membrane is 
associated with additional conformational changes, or gate openings. 
The crystallographic data and molecular dynamic simulations”*”** 
suggest that substrate and ion dissociation on the extracellular side is 
associated with the opening of HP2, defining it as an extracellular 
gate. We hypothesize that in a functionally symmetrical manner, 
movements of HP1 allow for dissociation of the substrate and ions 
from the inward facing state, defining it as a bona fide intracellular 
gate. The alternate opening of the extracellular and intracellular gates 
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Figure 4 | Amino-terminal inverted structural repeat. a, b, Cartoon 
representation of TM1—-TM3 (blue) and TM4—TM6 (green) in the 
Gltp),(55C/364Cy,) viewed in the membrane plain (a), and their structural 
superposition (b). c, Symmetrical helices, TM1-TM2 and TM4—TM5, form 
the interaction surface within the transport domain, which is partitioned 
into intracellular and extracellular halves delineated by the dotted line. 

d, Schematic representation of Gltp, trimerization (orange) and the 
transport (light blue) domains. Two inverted structural repeats are 
emphasized by blue and green and yellow and red trapezoids. e, f, Structure 
of TM2-TM3 and TM5-TM6 loops in wild-type Gltp,, (e) and Gltp),(55C/ 
364Cy,) (f). TM2-TM6 are shown in cartoon representation with TM4 
omitted for clarity. The transporter core is shown in transparent surface 
representation. Bound L-Asp and conserved glycines are shown as spheres. 


is strictly maintained: in the outward facing state, when HP2 opens to 
expose the substrate and ion binding sites to the extracellular solu- 
tion, HP1 is secured in the closed state, packed against TM2 and 
TMS. Conversely, in the inward facing state HP2, the extracellular 
gate, is locked closed upon displacing HP1. 

As has been suggested previously’, coupling of the substrate trans- 
port to the energy of the ionic gradients is established via synergistic 


Transport 
domain 


| Trimerization 
domain 
2-3 loop 


Open Occluded 


Outward facing 


Figure 5 | Schematic transport mechanism. Shown is a single transporter 
protomer. Substrate and sodium binding to the outward and inward facing 
states is coupled to the closure of the extracellular and intracellular gates, 
HP2 (red) and HP1 (yellow), respectively. Isomerization between the 
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binding of substrate and sodium ions on the extracellular and intra- 
cellular sides of the membrane coupled to the closure of the corres- 
ponding gates. The steep sodium dependence of the substrate binding, 
observed both in wild-type Gltp, (ref. 9) and in Gltp,(55C/364Cy,) 
(Fig. 2d), is responsible for the differential affinity for the substrate on 
the extracellular side of the membrane, where sodium concentration is 
high, and the intracellular side, where it is low. We further propose 
that the isomerization between the outward and inward facing states is 
sodium independent and may simply be driven by the thermal energy 
alone. The observation that Hg”*-mediated crosslinking of the K55C/ 
A364C mutant is completed within seconds both in membranes and 
in detergent suggests that this transition is rapid and does not limit the 
rate of transport in Gltp,, which has a turnover time of ~3 min at 
ambient temperatures”. Finally, we hypothesize that a similar isomeri- 
zation reaction may also occur in the apo-transporter to complete the 
transport cycle. The structural re-arrangements within the transport 
domain, which would allow for the closure of the extracellular and the 
intracellular gates in the absence of bound substrate and ions, remain 
to be elucidated. 

Note added in proof. During the review of this paper, we became 
aware of a modelling study of the inward facing state based on the 
symmetry considerations*'. It recapitulates several features of the 
crystal structure, and in particular proposes that HP2 packs against 
TM2 and TM5. 


METHODS SUMMARY 

Cysteine crosslinking. Gltp,(K55C/C321A/A364C) was expressed as a Hisg 
fusion protein and purified as described previously*. Transporter samples were 
exchanged by size-exclusion chromatography into buffer, containing 10 mM 
HEPES/NaOH or KOH, pH7.4, 1mM_ n-dodecyl-f-p-maltopyranoside and 
either 100mM NaCl and 100M L-Asp or 100mM KCI. Crosslinking was 
initiated by addition of 1:2 molar ratio of Cu** and 1,10-phenanthroline or 
HgCl, at indicated concentrations. Reactions were quenched with 100mM 
N-ethyl maleimide before SDS—polyacrylamide gel electrophoresis analysis. 
Crude E. coli membranes were isolated by centrifugation, washed either in a 
Na*/1-Asp-containing or free buffer and crosslinked as in detergent. Protein 
bands were visualized by western blotting using antibodies against histidine tag. 
Crystallography. The K55C/C321A/A364C mutation was introduced within a 
heptahistidine mutant of Gltp,, used in earlier crystallographic studies*’, to 
which we refer as ‘wild type’ throughout the text for brevity. Purified protein 
was crosslinked in the presence of 10-fold molar excess of HgCL, dialysed against 
buffer containing 10 mM HEPES/NaOH, 7 mM n-decyl-$-p-maltopyranoside, 
100mM NaCl and 100uM t-Asp, diluted to the final concentration of 
2-4mg ml! and supplemented with 0.5mM E. coli total polar lipid extract 
and 100 mM NaBr. Protein solution was mixed at 1:1 (vol.:vol.) ratio with the 
reservoir solution, containing 100 mM MES, pH 5.0, 18-20% PEG 350 MME 
and 200 mM CaCl, and crystallized at 4°C by hanging-drop vapour diffusion. 
Crystals were cryoprotected by allowing the drop to dry until its volume was 
reduced by 50%. SeMet substituted protein was expressed as described previ- 
ously® and crystallized as above. Diffraction data were indexed, integrated and 
scaled using the HKL-2000 package*'. Further analyses were performed using 


Occluded 
Inward facing 


outward and inward facing occluded states occurs upon movement of the 
transport domain (blue), relative to the trimerization domain (grey). The 
inward facing open state has not been structurally characterized and is 
hypothetical. 
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CCP4 programs”. Initial phases were obtained using Phaser*’, and the protein 
model built manually in Coot“ and refined using REFMAC” with TLS* and 
three-fold NCS restraints. 
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The SUMO modification pathway is 
involved in the BRCA1 response to 


genotoxic stress 


Joanna R. Morris'*, Chris Boutell’*, Melanie Keppler**, Ruth Densham’, Daniel Weekes’, Amin Alamshah’, 
Laura Butler’, Yaron Galanty*, Laurent Pangon’, Tai Kiuchi*®, Tony Ng® & Ellen Solomon’ 


Mutations in BRCA7 are associated with a high risk of breast and ovarian cancer. BRCA1 participates in the DNA damage 
response and acts as a ubiquitin ligase. However, its regulation remains poorly understood. Here we report that BRCA1 is 
modified by small ubiquitin-like modifier (SUMO) in response to genotoxic stress, and co-localizes at sites of DNA damage 
with SUMO1, SUMO2/3 and the SUMO-conjugating enzyme Ubc9. PIAS SUMO E3 ligases co-localize with and modulate 
SUMO modification of BRCA1, and are required for BRCA1 ubiquitin ligase activity in cells. In vitro SUMO modification of the 
BRCA1/BARD1 heterodimer greatly increases its ligase activity, identifying it as a SUMO-regulated ubiquitin ligase (SRUbL). 
Further, PIAS SUMO ligases are required for complete accumulation of double-stranded DNA (dsDNA) damage-repair 
proteins subsequent to RNF8 accrual, and for proficient double-strand break repair. These data demonstrate that the 
SUMOylation pathway plays a significant role in mammalian DNA damage response. 


The amino (N) terminus of BRCA1 has a RING domain that interacts 
with ubiquitin-conjugating enzymes and is required for its ubiquitin 
ligase activity’’. Many disease-causing mutations are found within 
this region, and loss of the ligase activity is associated with suscepti- 
bility to breast cancer*. We have previously shown that BRCA1- 
dependent ubiquitin conjugates are generated at sites of DNA 
damage repair in human cells*. Although the substrate(s) of the 
BRCAI ubiquitin ligase activity remains controversial®, and its role 
at this location unclear’, the ligase activity itself is highly conserved 
and damage-associated ubiquitin conjugates are also formed by 
Caenorhabditis elegans* and Gallus gallus’? homologues of BRCA1. 

BRCAI recruitment to chromatin at sites of DNA damage occurs 
through a complex cascade of protein modifications and interac- 
tions, and BRCAI is the third in a sequence of ubiquitin ligases 
recruited to such sites'’. Binding of the mediator of DNA damage 
checkpoint 1 (MDC1) protein to the phosphorylated tail of histone 
H2AX (y-H2AX) at sites of DNA breakage recruits the ubiquitin 
ligase RNF8''"*, which generates ubiquitin chains bound by 
RAP80:ABRAI, which in turn recruits BRCA1 through its carboxy 
(C) terminus’*~*°. The activity of the second ubiquitin ligase, 
RNF168, maintains the ubiquitin chain signal initiated by RNF8 
and thus helps retain BRCA1 at these sites’®”’. 

SUMOylation of substrates is catalysed by a cascade of enzymes: 
the activities of the El SUMO-activating enzyme (SAE1/SAE2), the 
E2-conjugating enzyme (Ubc9) and E3 SUMO ligases result in an 
isopeptide bond between the target lysine and the activated SUMO 
carboxyl terminus (reviewed in ref. 22). In vertebrates, three SUMO 
isoforms, SUMO1, SUMO2 and SUMO3, are expressed. SUMO2 and 
SUMO3 differ by three N-terminal residues, and form a distinct 
subfamily known as SUMO2/3. Ubc9 and SUMO E3 enzymes have 
been implicated in the DNA damage response in human cells and 


animal models**~”. In C. elegans, ce-ubc9 interacts with ce-Bard1, the 
N-terminal binding partner of ce-Brca1**. The interaction of BRCA1 
with free, non-conjugated SUMO! has been shown to decrease its 
transcriptional activity”. Here we investigated the previously un- 
addressed role of the SUMO pathway in the regulation of BRCA1 
and in the DNA damage response. 


SUMO conjugation in response to damage 

We noted that after treatments with genotoxic agents (irradiation, 
cisplatin and hydroxyurea), SUMO isoforms, SUMO1 and SUMO2/ 
3 localized to sub-nuclear damage repair foci marked with y-H2AX 
and BRCAI (Fig. la—c, Supplementary Fig. 2a and data not shown). 
Wealso saw increased fluorescence resonance energy transfer (FRET) 
between green fluorescent protein (GFP)—BRCA1 and red fluor- 
escent protein (RFP)-SUMO1I, measured by fluorescence lifetime 
imaging microscopy (FLIM), after treatment of cells with the geno- 
toxic agents cisplatin, irradiation, hydroxyurea and epirubicin 
(Supplementary Fig. 2b, c), indicating increased protein—protein 
interaction®’**. Increased FRET populations were observed in 
regions largely coincident with y-H2AX foci in hydroxyurea-treated 
cells, indicating BRCAI-SUMO1 interaction at or close to y-H2AX- 
decorated chromatin (Fig. 1d). The interaction was dependent on 
conjugation, as a SUMO mutant with a C-terminal di-glycine substi- 
tution (GG to GA) exhibited lower FRET with GFP-BRCAI (Fig. le, f). 
Consistent with this, immunoprecipitation of endogenous BRCA1 
from hydroxyurea-treated cells co-purified high molecular mass endo- 
genous SUMO conjugates (Supplementary Fig. 2d), suggesting either 
SUMOylation of BRCA1 itself or interaction of BRCA1 with large, 
SUMO-modified proteins. De novo SUMOylation at sites of DNA 
damage would require localization of the SUMO-conjugating enzyme, 
Ubc9, to these sites. We found that Ubc9-GFP co-localized both with 
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Figure 1| The SUMO-conjugation machinery locates to sites of DNA 
damage, and BRCA1 is modified by SUMO after genotoxic stress. COS-7 
cells with (HU) or without (Unt) hydroxyurea treatment stained with anti- 
SUMO isoforms and y-H2AX (a) or BRCA1 and counted in b and c. Error 
bars, s.d.; n > 30 cells per condition. Scale bars, 10 tum throughout the 
figures. d, BRCA1-SUMO1 interaction at sites of genotoxic stress. Images of 
y-H2AX, GFP and RFP multiphoton intensity and FLIM in transfected cells 
with or without hydroxyurea. FRET shortens the GFP-fluorescence lifetime 
in orange to red. e, Myc-SUMO1-GA conjugates poorly compared with wild- 
type myc-SUMO1 in denaturing SDS-polyacrylamide gel electrophoresis 
immunoblotted with anti-myc. f, FRET efficiency of RFP-SUMO1-GA or 
RFP-SUMO1 with GFP-BRCA1 in treated cells. Bars, s.e.m. g, SUMOylation 
of BRCAI. Nickel precipitation (Pr, Ni’*) from untransfected (HeLa) and 
6X His-SUMO-expressing cells treated with the agents shown. 


y-H2AX and BRCA1 in treated cells and showed increased FRET with 
RFP-BRCAI (Supplementary Fig. 2e, f). 

These data are consistent with modification of BRCA1 and/or 
closely associated proteins. As SUMO proteins are covalently linked 
to target proteins, we purified hexahistidine (6X His)-tagged SUMO 
conjugates using nickel-charged beads in highly denaturing condi- 
tions, and probed for BRCA1. BRCA1 was enriched on nickel after 
cell treatment with hydroxyurea, cisplatin and heat shock but was 


ARTICLES 


absent from untreated or untransfected cells, or cells treated with 
hydroxyurea and then low concentration HO, (Fig. 1g). (H2O, 
cross-links Ubc9 and SAE2, preventing SUMO conjugation in HeLa 
cells**, indicating that like most known SUMO targets, the modifica- 
tion of BRCA1 is rapidly processed.) Both SUMO isoforms conjugate 
to BRCAI, but more BRCA] was purified with SUMO2 than SUMO1, 
particularly after cisplatin treatment. The effect of genotoxic agents on 
total cellular SUMO conjugates is shown in Supplementary Fig. 2g. 

Together, these data indicate that the SUMO conjugation pathway 
forms part of the mammalian response to DNA damage, as Ubc9 and 
the SUMO proteins interact with at least one DNA damage-regulated 
SUMOylation target, BRCAI, at sites of genotoxic stress labelled by 
-H2AX. 


PIAS SUMO ligases in the damage response 

Members of the protein inhibitor of activated signal transducer and 
activator of transcription (PIAS) family of SUMO E3 ligases and the 
Mms21(NSE2) SUMO ligase are found in foci within the nucleus**, and 
have been reported to play a role in the DNA damage response**”’. 
Short interfering RNA (siRNA) depletion of PIAS] and PIAS4 impaired 
6X His SUMOI and SUMO2 modification of endogenous BRCA1 in 
hydroxyurea-treated cells, unlike depletion of MMS21, PIAS2 or PIAS3 
(Fig. 2a and data not shown). In untreated cells, depletion of PIAS1 
resulted in increased SUMO2-conjugated BRCA1: this was dependent 
on PIAS4 because depletion of PIAS] and PIAS4 inhibited the modi- 
fication (Fig. 2a). Quantitative PCR with reverse transcription (RT- 
PCR) showed that siRNAs to each SUMO ligase were E3 specific, and 
ligase depletion had no impact on steady-state BRCA1 protein levels 
(Supplementary Fig. 3a, b). Ectopic expression of PIAS] and PIAS4 (but 
not MMS21) increased BRCAI-SUMO interaction, as measured by 
FLIM (Fig. 2b) and 6XHis-SUMO-conjugated BRCA1 (Fig. 2c). 
Increased expression or depletion of PIAS ligases had no obvious 
impact on total SUMO1 conjugates, but SUMO2 conjugates (in 
6XHis-SUMO2 cells) were decreased on PIAS1 depletion, and 
increased after ectopic expression of PIAS1 and PIAS4 (Fig. 2a and 
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Figure 2 | PIAS E3 SUMO ligases modulate BRCA1 SUMOylation. a, PIAS 
ligases are required for BRCA1 SUMOylation. Nickel elutant from untreated 
or hydroxyurea-treated 6 x His-SUMO-expressing cells transfected with 
siRNA to transcripts shown (Non-T, non-targeting) and probed with 
antibodies shown. SUMO conj, SUMO conjugates. b, PIAS ligases increase 
BRCA1-SUMO1 interaction. FRET efficiency of RFP-SUMO1 with 
GFP-BRCA1 in untreated (Unt) cells or transfected with SUMO ligases 
shown, or cells treated with hydroxyurea. Bars, s.e.m. ¢, PIAS ligases increase 
BRCAI1 SUMOrylation. Nickel elutant from 6X His-SUMO-expressing cells 
transfected with PIAS expression constructs. 
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Supplementary Fig. 3c). Thus BRCA1 is one of many substrates modi- 
fied by SUMO2 in response to PIAS activity but is one of a smaller 
population of SUMO1 substrates regulated by PLAS proteins. 

We assessed the impact of SUMO-ligase depletion on BRCA1 locali- 
zation in cells and found that loss of PIAS] and PIAS4, but not MMS21, 
PIAS2 or PIAS3 SUMO ligases, reduced its ability to localize to 'y-H2AX 
in hydroxyurea-treated cells (Fig. 3a, b and data not shown). The intro- 
duction of siRNA-resistant PIAS proteins at low expression levels 
restored the ability of endogenous BRCAI1 to co-localize with 
y-H2AX (Supplementary Fig. 4a). PIAS1, PIAS4 and MMS21 ligases 
expressed in cells co-localized with BRCA1. High expression of PIAS] 
or PIAS4, but not MMS21, together with BRCA1 and SUMOI1 or 
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Figure 3 | PIAS E3 SUMO ligases influence accumulation of DNA damage- 
repair protein and are required for dsDNA break repair. a, BRCA1 
accumulation to y-H2AX in hydroxyurea-treated cells transfected with siRNAs 
indicated and counted in b (bars, s.d.; 1 > 30 cells per condition). ¢, Hydroxyurea 
and siRNA treated cells scored for y-H2AX foci with accumulated dsDNA 
damage proteins (bars, s.d.; n > 30 cells per condition) and representative 
immunofluorescence images shown in d. e, Homologous recombination (HR) 
and non-homologous end joining (NHEJ) assayed in siRNA transfected cells 
bearing integrated gene conversion and end-joining substrates after I-scel- 
induced dsDNA break. Bars, s.d. f, Colony survival of siRNA-transfected cells 
exposed to cisplatin. Combined sensitivity of PLAS1 + PIAS4 was characterized 
as additive by the Bliss independence model”. 
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SUMO2, resulted in exaggerated GFP—BRCA1 foci (Supplementary 
Fig. 4c). The formation of exaggerated foci was dependent on the 
SUMO ligase activity, because an inactive PIAS] RING mutant” inhi- 
bited the ability of PIAS] both to induce increased BRCAI-SUMO1 
interaction in cells (Supplementary Fig. 4b) and to cause exaggerated 
BRCA1 foci (Supplementary Fig. 4c, d). Together, these data indicate 
that PIAS] and PIAS4 SUMO ligases modulate and are required for 
normal accumulation of BRCA1 at sites of genotoxic stress. 

To examine whether impaired BRCA1 accumulation following 
PIAS depletion is likely to be direct or indirect, we investigated the 
integrity of the upstream accumulation cascade by examining RAP80, 
K63-linked ubiquitin (K63-Ub), RNF168 and RNF8, all proteins 
necessary for BRCA1 recruitment to sites of genotoxic stress‘? '*1°"871, 
PIAS4 was required for normal accumulation of proteins subsequent 
to RNF8, affecting RNF168, K63-Ub, RAP80 and BRCA1, whereas 
PIAS1 was required for complete accumulation of proteins subsequent 
to the generation of K63-linked ubiquitin, affecting RAP80 and BRCA1 
(Fig. 3c, d and Supplementary Fig. 4e). Thus, although BRCAI is 
SUMO modified in a PIAS-dependent manner, these data indicate that 
its accumulation is regulated through an indirect mechanism involving 
earlier-arriving proteins. 

Consistent with the requirement on PIAS proteins for BRCA1 
accumulation, depletion of these ligases, like depletion of BRCA1, 
reduced homologous recombination and non-homologous end- 
joining repair of double-strand breaks, and increased cellular sensi- 
tivity to cisplatin (Fig. 3e, f), indicating that these SUMO ligases are 
required for the full response to DNA damage. 


SUMO conjugation influences BRCA1 ligase 


In cells, ubiquitin conjugates detected by the monoclonal antibody 
FK2 are lost from sites of DNA damage after depletion of many 
proteins involved in accumulation of dsDNA repair protein, includ- 
ing the ubiquitin ligases RNF8, RNF168 and BRCAI (refs 5, 8-10, 21, 
38). FK2-ubiquitin accumulation to y-H2AX was also reduced after 
depletion of PIAS1 and PIAS4 (data not shown). Because PIAS1 
depletion does not impair RNF168/K63-Ub accumulation (Fig. 3c), 
this suggests that the impact on FK2-ubquitin is independent or 
subsequent to RNF8/RNF168 activity. FK2-ubiquitin accumulation 
was also reduced in the small population of PIAS-depleted cells that 
retained some BRCAI foci (Fig. 4a, b), suggesting disruption of an 
activity subsequent to BRCA1 accumulation. 

Ectopic expression of BRCA1/BARD1 in cells is able to increase 
FK2-ubiquitin conjugate staining above levels detected in surround- 
ing, non-transfected cells, in a manner dependent on a functional 
RING domain’. This ability was absent in PIAS1/4-depleted cells 
(Supplementary Fig. 5a), indicating that the BRCA1 ubiquitin ligase 
activity is reduced. Similarly the co-localization of K6-linked ubiqui- 
tin (catalysed by BRCA1 (refs 5, 39, 40)) with y-H2AX was impaired 
in BRCA1-, PIAS1- or PIAS4-depleted cells, consistent with loss of 
BRCAI ligase activity (Supplementary Fig. 5b). 

SUMO conjugation frequently, but not always, occurs on lysines in 
the consensus motif, “YKxE’, where is an aliphatic residue. The 
two highest-scoring motifs in BRCA1 are residues K109 and K119, 
adjacent to the RING domain (Fig. 4c). The interaction of BRCA1 
K109-—>R with RFP-SUMO1 was comparable to that of wild-type 
BRCAI (data not shown). However, substitutions of the K119 motif 
(K119—R or E121—A) reduced interaction with RFP—SUMO1 
(Fig. 4d). These mutations, like the RING mutation, C61G, reduced 
the ability of ectopic BRCA1 to induce increased levels of co-localiz- 
ing FK2 ubiquitin conjugates in cells (Fig. 4e, f), consistent with the 
effects of PIAS1 and PIAS4 ligase depletion. 

These observations imply that the SUMOylation pathway acts 
directly on the BRCA1/BARD1 ligase. To test this, we explored 
BRCA1 SUMO modification in vitro (Supplementary Fig. 5c, d). 
Titration of unmodified against SUMO1-modified heterodimer 
showed that the modified form had increased ubiquitin ligase activity, 
generating 10-20 times more conjugated ubiquitin (Fig. 4g). 
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Figure 4 | The SUMO pathway regulates BRCA1 ubiquitin ligase activity. 

a, FK2-ubiquitin accumulation with BRCA1 in siRNA and hydroxyurea-treated 
cells and counted in b (bars, s.d.; 1 > 30 cells per condition). ¢, Illustration of 
BRCA1 motifs and consensus SUMO sites (K) identified by both abgent.com/ 
doc/sumoplot and bioinformatics.lcd-ustc.org/sumosp programs; K119 and 
K109 rank highest in both. d, BRCA1 K119 consensus is required for interaction 
with SUMO1 in hydroxyurea-treated cells. Bars, s.e.m. e, Cells expressing SUMO 
consensus or RING-C61G mutants stained with low titration FK2 and scored in 
f (bars, s.d.; > 30 cells per condition). WT, wild type. g, h, SUMO-modification 
increases the ubiquitin ligase activity of BRCA1)_147-BARD 116142. BR, BRCA1; 
BD, BARD1; SU1, SUMO1; SU2, SUMO2. Ubmix indicates ubiquitin 
conjugation components except the heterodimer. 


Comparison of unmodified heterodimer with SUMO1 and SUMO2 
modified forms, at a concentration where the unmodified proteins 
had little detectable activity, showed that the SUMO modification 
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enhanced ubiquitin ligase activity independent of isoform specificity 
(Fig. 4h). 


Discussion 


SUMO modification increases BRCA1 ubiquitin ligase activity in 
vitro, consistent with the requirement in cells for PIAS SUMO E3 
ligases and for an N-terminal SUMO modification consensus site, 
thus identifying BRCA1 as a SUMO-regulated ubiquitin ligase 
(SRUbL). SUMO modification of BRCAI and occupation of the 
BRCAI RING by ubiquitin E2 conjugating enzymes are concurrent, 
supported by the observation that the presence of several-fold molar 
excess of the ubiquitin E2, UbcH5a, has little impact on BRCA1 
SUMO modification in vitro (data not shown). Similarly, mutations 
that inhibit BRCA1-ubiquitin E2 interactions (T77—-M, I26—A, 
C61-—G or the absence of BARD1 polypeptide) had no impact on 
BRCA1 SUMOrylation, indicating that the BRCA1/BARD1 heterodi- 
mer RING domains are not required for SUMO pathway interaction 
(data not shown). Thus the simplest mechanism envisaged is that 
SUMO modification of BRCA1 increases the E3—E2 interface, 
through SUMO interaction with the E2 enzyme (possibly through 
SUMO interacting motifs). Based on in vitro investigations, other 
authors*’* have shown that auto-ubiquitylation of BRCA1 at posi- 
tions C-terminal to its RING domain regulates ligase activity and E2 
choice, although how BRCA1 SUMOylation and auto-ubiquitylation 
relate is yet to be clarified. 

These data show that the PIAS SUMO ligases are necessary compo- 
nents of the mammalian response to double-strand breaks, required 
for homologous recombination and non-homologous end-joining, 
and that they influence BRCA1 accumulation through earlier-arriving 
proteins. However, the details of their regulation by the SUMO path- 
way remain to be determined. It is possible, for example, that like 
BRCAI, the other ubiquitin ligases in the pathway, RNF8 and 
RNF168, are also SRUbLs regulated by SUMO-modification (Sup- 
plementary Fig. 1). 

The known post-translational modifications of BRCA1 now 
include phosphorylation, ubiquitylation and SUMOylation. 
Because the two features of BRCAI activity regulated by the 
SUMO pathway, ubiquitin ligase activity and accumulation at sites 
of DNA damage, are also inhibited by some BRCA1 mutations that 
predispose to breast and ovarian cancer**’, it seems highly likely that 
the SUMO pathway will be of relevance to cancer predisposition and 
development. 


METHODS SUMMARY 

Cell treatments. Cells were treated with 10 1M cisplatin for 3 h followed by 16h 
recovery, 16h in 20nM epirubicin, 3mM hydroxyurea, or 3mM hydroxyurea 
for 8h followed by 1mM H,O, for 15min, 15 min heat shock at 43°C or 
exposure to 10 Gy irradiation using a caesium-137 source. 

SiRNA. SMART Pool siRNAs (Dharmacon) used were: BRCA1 (L-003461-00), 
PIAS1 (L-008167-00), PIAS2 (L-009428-00), PIAS3 (L-004164-00) PIAS4 
(L-006445-00) MMS21/NSE2 (L-018070-00), RNF168 (L007152-00-0005) and 
RNEF8 (L-009600-00-0005), non-targeting control siRNA, confirmed to have 
minimal targeting of known genes (D-001810-10-05). The untranslated regions 
target sequences were as follows: GGCGAAGUUCACUGCGC (PIAS1), 
CAGAGGGAGGAGUGACC (PIAS4). 

Purification of 6 xX His tagged SUMO conjugates. This was as described previously**. 
Immunofluorescence microscopy. This was performed as previously described’. 
Antibodies. The following antibodies were used: MS110 (Ab1, Calbiochem), 
FK-2 (Biomol), anti-Flag (M2) (Sigma), control rabbit IgG (Sigma), B-actin 
(Abcam), anti-c-myc, anti-SUMO1, anti-SUMO2/3 (Santa Cruz), y-H2AX 
(Millipore and Abcam), anti-RNF8 (Abnova), anti-RNF168", anti-RAP80 
(Bethyl) and anti-K63-ubiquitn (Millipore). 

Time-resolved multiphoton microscopy. Measurements were undertaken with 
a modified system similar to that described previously”. Fluorescence lifetime 
imaging used time-correlated single-photon counting electronics (Becker & 
Hickl, SPC 830) collected through a bandpass filter centred at 
2= 510+ 10nm (Chroma). Excitation power was adjusted using a neutral den- 
sity filter to photon counting rates ~10*-10° photons s_', and acquisition times 
~300s at low excitation power were used. Imaging control and analysis used 
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custom software (CVI LabWindows)**. The Forster radius of the GFP and RFP 
pair used has been calculated as 4.7 nm (ref. 31). 

Repair sssays. Homologous recombination and non-homologous end-joining 
cell assays were performed as previously descibed*’. 

Protein production and ubiquitin ligase assays. These were as previously 
described’. 

In vitro SUMO conjugation assays. These were as previously described”. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 


Received 17 May; accepted 19 October 2009. 
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METHODS 

Plasmid constructs. Full-length BRCA1, SUMO1 and SUMO1-GA were cloned 3’ 
of the modified RFP of pcDNA3.1 (ref. 31) or in myc-pcDNA3.1, and full-length 
BRCAI and UBC9 were cloned 3’ of the GFP in p-EGFP (Clontech). Flag—HIS- 
MMS21 complementary DNA was cloned into pCL-NCX previously modified to 
contain 3Flag—HIS tag, and PIAS4 in 3X Flag—Stag in pCDNA3.1(—) modified 
to contain 3Flag—Stag. PIAS] expression constructs were gifts (see Acknowledge- 
ments). Point mutations were generated using site-directed mutagensis and con- 
firmed by sequencing. Ubiquitin and BARD1 clones in pcDNA3.1 have been 
previously described’. 

Cell treatments. Cells were treated with 3 mM hydroxyurea for 8h, 0.9% NaCl 
carrier, 101M cisplatin for 3h followed by 16h recovery, or 16h in 20nM 
epirubicin, 3mM hydroxyurea for 8h followed by 1mM HO, for 15 min or 
15 min heat shock at 43 °C or cells were exposed to 10 Gy irradiation using a 
Gammacell 1000 Elite irradiator (caesium-137 source). 

siRNA. On-target Plus SMART Pool siRNAs (Dharmacon) used were: BRCA1 
(L-003461-00), PIAS1 (L-008167-00), PIAS2 (L-009428-00), PIAS3 (L-004164- 
00) PIAS4 (L-006445-00) and MMS21/NSE2 (L-018070-00), RNF168 (L007152- 
00-0005), RNF8 (L-009600-00-0005). Non-targeting control siRNA has been 
confirmed to have minimal targeting of known genes (D-001810-10-05). 
siRNAs to untranslated regions used were to target sequence: GGCGAAG 
UUCACUGCGC (PIAS1), CAGAGGGAGGAGUGACC (PIAS4). Knockdowns 
were confirmed by RT-PCR of extracted cell-line RNA and were designed over 
more than one exon boundary to encompass the active site (RING). 
Purification of 6 x His tagged SUMO conjugates. His-SUMO stable Hela cells** 
were maintained in 0.5 ug ml! puromycin and were a gift from R. Hay. Cells 
were lysed directly in 8 M urea, 0.1 M Nag HPO4/NaH2POg, 0.01 M Tris-HCl, pH 
6.3, 10 mM f-mercaptoethanol, 5 mM imidazole plus 0.2% Triton-X-100, har- 
vested and sonicated. They were then mixed with 50 ul of Ni?* Talon agarose 
beads (BD-Bioscience) and incubated overnight at 4°C, washed and eluted in 
SDS-polyacrylamide gel electrophoresis buffer according to ref. 44. 
Immunofluorescence microscopy. All cells were grown and prepared as previ- 
ously described’. 

Antibodies. The antibodies used in the study were MS110 ascites (Abl, 
Calbiochem), FK-2 (Biomol), anti-BARD1 antibody (Serotec), anti-Flag (M2) 
(Sigma), control rabbit IgG (Sigma), B-actin (Abcam), anti-c-myc (9E10), anti- 
SUMOIL, anti-SUMO2/3, (Santa Cruz), y-H2AX clone JBW301 (Millipore) and 
polyclonal anti-y-H2AX (Abcam), anti-RNF8 (Abnova), anti- RNF168 (ref. 10), 
anti-RAP80 (Bethyl) and anti-K63-ubiquitin (Millipore). 

Time-resolved multiphoton microscopy. FLIM measurements were under- 
taken with a modified multiphoton microscopy system similar to that described 
previously*. Fluorescence lifetime imaging capability was provided by time- 
correlated single-photon counting electronics (Becker & Hickl, SPC 830). 
Data were collected through a bandpass filter centred at 4=510+ 10nm 
(Chroma). Excitation power was adjusted using a neutral-density filter to give 
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average photon counting rates of the order 10‘-10° photons s_' to avoid pulse 
pile up. Acquisition times of the order of 300 s at low excitation power were used 
to achieve sufficient photon statistics for fitting, while avoiding either pulse pile 
up or observable photo-bleaching. The imaging system was controlled, and the 
data later analysed, with custom software written in CVI LabWindows**. FRET 
efficiency = 1 — tda/tdi, where da is the pixel-by-pixel fluorescence lifetime of 
the donor in the presence of acceptor and diis the average lifetime of the donor in 
the absence of acceptor (in all experiments unlabelled BARD 1 was co-expressed). 
The Forster radius (distance at which the efficiency of energy transfer is 50%) of 
the GFP and RFP pair has been calculated as 4.7nm (ref. 31). Note that in 
analysis of FRET data there are two elements that must be considered: interacting 
fluorophore population and FRET efficiency. Bulk measurements of FRET effi- 
ciency cannot distinguish between an increase in FRET efficiency (that is, proxi- 
mity) and an increase in FRET population (concentration of interacting species) 
because the two parameters are not resolved. 

Repair assays. HeLa clones with the homologous recombination (DR-GFP) and 
total non-homologous end-joining (EJ5—GFP) reporters stably integrated were 
generated as previously described*’. For repair assays, HeLa-DR-GFP or Hela- 
EJ5-GFP were either mock transfected or transfected with non-targeting or 
targeting siRNA. Cells were left for 24h before transfection with the I-scel 
expression vector pCBA-I-scel. Three days after pCBA-I-scel transfection, cells 
were fixed and the proportion of GFP-positive cells counted. Counts were per- 
formed in triplicate. 

Clonogenic cell survival assays. 293T cells were plated onto 24-well tissue culture 
dishes at (10° cells per well) and transfected with siRNA and plasmid with 
Dharmafect according to the manufacturer’s instructions. After 3 days, they were 
exposed to cisplatin (Sigma Chemicals) for 3 h and replated into 10-cm dishes at 
various concentrations. After 11 days, cells were fixed with methanol for 10 min 
and stained with 0.5% crystal violet (BDH Chemicals). Washed dishes were dried, 
and colonies >1mm were scored. Half-maximum inhibitory concentration 
(ICs9) values were calculated for each siRNA from the respective sigmoidal 
dose-response curves using Prism software. The Bliss independence model is 
defined by the equation Exy = Ex + Ey — (ExEy), where Exy is the additive effect 
of siRNA 1 and 2 as predicted by their individual effects (Ex and Ey)”°. 

Protein production, SUMO conjugation and ubiquitin ligase assays. Bacterial 
expression of human BRCA1/BARD1 heterodimer was from bi-cistronic 
expression vector purified using nickel resin as described previously’. 

In vitro SUMO conjugation assays. These were performed as described by 
Boutell et al.***’. Purification of SUMOylated complexes is described in 
Supplementary Fig. 2. 

Immunoprecipitation. 293T cells were lysed in 20 mM Tris-HCL, pH 8, 137 mM 
NaCl, 1mM EGTA, 1% Triton-X-100, 10% glycerol, 1.5MgCl,, containing 
10 mM iodoactetamide and protease inhibitors. After clearance, lysate was incu- 
bated with 20.1 MS110 (Ab1) overnight. Beads were washed in lysis buffer 
before elution in SDS—polyacrylamide gel electrophoresis buffer. 
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A super-Earth transiting a nearby low-mass star 


David Charbonneau’, Zachory K. Berta’, Jonathan Irwin’, Christopher J. Burke’, Philip Nutzman’, 
Lars A. Buchhave’”, Christophe Lovis’, Xavier Bonfils’*, David W. Latham’, Stéphane Udry’, Ruth A. Murray-Clay’, 
Matthew J. Holman’, Emilio E. Falco’, Joshua N. Winn?, Didier Queloz’, Francesco Pepe®, Michel Mayor’, 


Xavier Delfosse* & Thierry Forveille* 


A decade ago, the detection of the first’? transiting extrasolar 
planet provided a direct constraint on its composition and opened 
the door to spectroscopic investigations of extrasolar planetary 
atmospheres’. Because such characterization studies are feasible 
only for transiting systems that are both nearby and for which the 
planet-to-star radius ratio is relatively large, nearby small stars 
have been surveyed intensively. Doppler studies**® and micro- 
lensing’ have uncovered a population of planets with minimum 
masses of 1.9-10 times the Earth’s mass (M4), called super-Earths. 
The first constraint on the bulk composition of this novel class of 
planets was afforded by CoRoT-7b (refs 8, 9), but the distance and 
size of its star preclude atmospheric studies in the foreseeable 
future. Here we report observations of the transiting planet 
GJ 1214b, which has a mass of 6.55Mg and a radius 2.68 times 
Earth’s radius (Rg), indicating that it is intermediate in stature 
between Earth and the ice giants of the Solar System. We find that 
the planetary mass and radius are consistent with a composition of 
primarily water enshrouded by a hydrogen-helium envelope that 
is only 0.05% of the mass of the planet. The atmosphere is probably 
escaping hydrodynamically, indicating that it has undergone sig- 
nificant evolution during its history. The star is small and only 
13 parsecs away, so the planetary atmosphere is amenable to study 
with current observatories. 

The recently commissioned MEarth Project'®"’ uses an array of 
eight identical 40-cm automated telescopes to photometrically mon- 
itor 2,000 nearby M dwarfs with masses between 0.10 and 0.35 solar 
masses (Mo) drawn from a sample’ of nearby stars with a large 
proper motion. After applying a trend-filtering algorithm’ and a 
three-day running median filter to remove long-term stellar variabil- 
ity, we searched” the light curves for evidence of periodic eclipsing 
signals. The light curve of the star GJ 1214 contained 225 data points, 
of which six values were consistent with having been gathered during 
a time of eclipse and indicating a signal with a period of 1.58 days. On 
the basis of this prediction, we gathered additional photometric 
observations at high cadence using the eight telescopes of the 
MEarth array as well as the adjacent 1.2-m telescope. These light 
curves (shown in Fig. 1) confirm that the star is undergoing flat- 
bottomed eclipses with a depth of 1.3%, indicative of a planetary 
transit. Astrophysical false positives that result from blends of eclips- 
ing binary stars and hinder field transit surveys are not'®"’ a concern 
under the strategy of the MEarth survey. GJ 1214 has a large proper 
motion, and by examining archival images we established that no 
second star lies at the current sky position of GJ 1214, ruling out a 
blend resulting from an eclipsing binary that is not physically assoc- 
iated with the target. The measured parallax and photometry of 
GJ 1214 (Table 1) place stringent constraints on the presence of an 
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unresolved physically associated binary companion: we find no phys- 
ically plausible coeval model that matches both the observed transit 
depth and the short duration of ingress and egress. We subsequently 
used the HARPS”® instrument to gather radial velocity observations 
(Fig. 2 and Supplementary Information), which confirmed the plan- 
etary nature of the companion and permitted us to estimate its mass. 

Table 1 presents our estimates of the physical quantities for planet 
and star. We estimate the planetary equilibrium temperature to be as 
great as 555 K (the case for a Bond albedo of 0) and as low as 393K 
(assuming a Bond albedo of 0.75, the same as that for Venus). This 
latter value is significantly cooler than all known transiting planets, 
and exceeds the condensation point of water by only 20K. This 
consideration is significant, because it demonstrates that for M 
dwarfs the discovery of super-Earths within the stellar habitable zones 
is within reach of ground-based observatories such as MEarth, 
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Figure 1| Photometric data for GJ1214. Light curves of GJ 1214 spanning 
times of transit for four separate transit events, gathered with the MEarth 
Observatory (either a single telescope or eight telescopes, denoted 
respectively by MEarth X 1 and MEarth X 8) and the F. L. Whipple 
Observatory (FLWO) 1.2-m telescope. All light curves have been binned to a 
uniform cadence of 45s to facilitate a visual comparison. We fitted the 
unbinned light curves to a model”? corresponding to a planet in a circular 
orbit transiting a limb-darkened star, setting the limb-darkening coefficients 
to match the inferred stellar properties as described in the text. This model 
has five parameters: the orbital period P, the time of transit centre T,, the 
ratio of the radius of the planet to that of the star R,/R,, the ratio of the 
semimajor axis to the stellar radius a/R,, and the orbital inclination i. We use 
a Markov chain Monte Carlo method to estimate the uncertainties, and our 
results are stated in Table 1. The solid lines show the best-fit model fitted 
simultaneously to all the data. 
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national de la recherche scientifique, Laboratoire d'Astrophysique de Grenoble (LAOG), UMR 5571, 38041 Grenoble Cedex 09, France. °Department of Physics, Kavli Institute for 
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Table 1| System parameters for GJ 1214 


Value 


1.5803925 + 0.0000117 
2454964.944208 + 0.000403 
2454980.7479702 + 0.0000903 
2454983.9087558 + 0.0000901 
2454999.712703 + 0.000126 
0.1162 + 0.00067 


Parameter 


Orbital period, P (days) 
Times of centre of transit, T. (HJD) 


Planet/star radius ratio, R,/R; 


Scaled semimajor axis, a/R, 14.66 + 0.41 
Impact parameter, b 0.35476-663 
Orbital inclination, i (deg) 88.62.2652 
Radial velocity semi-amplitude, K (ms +) T22+1.6 


—21,100 + 1,000 
<0.27 (95% confidence) 


Systemic velocity, y (ms *) 
Orbital eccentricity, e 


Stellar mass, M, 0.157 + 0.019Me 
Stellar radius, R, 0.2110 + 0.0097Re 
Stellar density, p, (kgm °) 23,900 + 2,100 
Log of stellar surface gravity (CGS units), 4.991 + 0.029 

log gs 
Stellar projected rotational velocity, v sini <2.0 

(kms *) 
Stellar parallax (mas) 772454 
Stellar photometry 

Vv 15.1+0.6 

| 11.52+0.1 

J 9.750 + 0.024 

H 9.094 + 0.024 

K 8.782 + 0.020 
Stellar luminosity, L, 0.00328 + 0.00045Le6 
Stellar effective temperature, Tar (K) 3,026 + 130 
Planetary radius, R, 2.678 + 0.13Re@ 
Planetary mass, M, 6.55 + 0.98Me 
Planetary density, pp (kg m~?) 1870 + 400 
Planetary surface acceleration under gravity, 8.93+1.3 

dp (ms *) 
Planetary equilibrium temperature, Teg (K) 

Assuming a Bond albedo of 0 555 

Assuming a Bond albedo of 0.75 393 


To convert the photometric and radial velocity parameters into physical parameters for the 
system, we require a constraint on the stellar mass. Using the observed parallax distance”° of 
2.95 + 0.9 pc and apparent K-band brightness, we employ an empirical relation’” between 
stellar mass and absolute K-band magnitude to estimate the stellar mass. With this value we 
ind the planetary radius and mass. The uncertainty on the planet mass is the quadrature sum of 
he propagated uncertainties on the radial-velocity amplitude and those from the uncertainty in 
he stellar mass, which contribute 0.85M. and 0.50Mg to the error budget, respectively. We 
use the observed | — K colour and an empirical relation”® to estimate the bolometric correction 
and subsequently the stellar luminosity and stellar effective temperature (assuming the stellar 
radius quoted in the table). Using the luminosity, we estimate a planetary equilibrium 
emperature, assuming a value for the Bond albedo. HJD, heliocentric Julian date. 


whereas the discovery of such objects orbiting solar analogues is 
thought to require space-based platforms such as the Kepler 
Mission”. 

We compare in Fig. 3 the measured mass and radius of GJ 1214b 
with that of models’® that predict planetary radii as a function of mass 
and assumed composition. We consider a hypothetical'® water- 
dominated composition (75% HO, 22% Siand 3% Fe) and take this 
prediction to be an upper bound on the planet radius, assuming a 
solid composition. This model provides a minimum mass for the 
gaseous envelope: assuming that the envelope is isothermal (with a 
temperature corresponding to a Bond albedo of 0, above) and com- 
posed of hydrogen and helium, and that the observed planetary 
transit radius corresponds” to an atmospheric pressure of 1 mbar, 
we estimate a scale height of 233 km and a total envelope mass of 
0.0032Mz (0.05% of the planetary mass). In this model, the relative 
mass of the envelope to the core is much smaller than that for the ice 
giants of the Solar System. If we continue under this assumed com- 
position and consider both the Solar System planets and the extra- 
solar worlds together in Fig. 3, the sequence decreasing in mass from 
HD 149026b and Saturn to HAT-P-11b, GJ 436b, Neptune and 
Uranus, and finally GJ 1214b would then trace an atmospheric deple- 
tion curve: the mass of the gaseous envelope relative to that of the 
core would decrease with mass, which is consistent with the fact that 
the atmospheres of Earth and Venus are each only a trace component 
by mass. We note, however, that with only an estimate of the average 
density, we cannot be certain that GJ 1214b, GJ 436b and HAT-P-11b 
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Figure 2 | Change in radial velocity of GJ1214. a, We gathered 21 
observations during 2009 July 24 to 2009 August 6, and six observations 
during 2009 June 11-19. We estimate’ the change in the radial velocity by 
first constructing a stellar template by summing the observations (corrected 
to the barycentre), and then minimizing the y* difference between this 
template and each spectrum. We initially restricted our analysis to the 
July—August data (shown as filled points, with repetitions shown as open 
symbols), out of concern that long-term stellar variability or a second planet 
could lead to an offset between these data and those gathered in June (not 
shown). We fitted a sinusoidal model (solid curve) constrained by the 
photometric period and time of transit (dotted lines) and found a good fit 
(77 = 15.98 for 19 degrees of freedom) with a semi-amplitude of 
K=12.2+1.6ms !. We considered an eccentric orbit, and found that the 
best-fit model (7? = 13.02 for 17 degrees of freedom) was not significantly 
better and yielded an indistinguishable K. We conclude that there is no 
evidence that the orbit is non-circular, and we state the upper limit in Table 1. 
We then included the June observations and found K=12.4+1.8ms (, 
which is consistent with but noisier than the previous estimate. However, to 
obtain a ~ consistent with an acceptable fit, we need to introduce an 
additional noise term of 2.7ms_', or an offset of —-8ms ! from the June 
data to the July-August data. Our photometry indicates that the stellar 
brightness varies by 2% on timescales of several weeks. We conclude that 
spot-induced stellar jitter is the most likely explanation. b, Residuals of the 
July-August data to the sinusoidal model. The residuals are consistent with 
the internal estimates of the uncertainties, shown here as 1o error bars. 


do not have compositions significantly different from that assumed 
above. For example, these planets could contain cores of iron or 
silicates enshrouded by much more massive envelopes of hydrogen 
and helium, a situation that would challenge models of formation but 
is not excluded by the current observations. 

Our estimate of the stellar radius is 15% larger than that predicted 
by theoretical models"* for the stellar mass we derived. Such discrep- 
ancies are well established from observations of M-dwarf eclipsing 
binaries, and indeed a similar stellar radius enhancement was deter- 
mined"? for the only other M-dwarf with a known transiting planet, 
GJ 436. Ifthe true value of the stellar radius is 0.18Ro (as predicted by 
both the theoretical models'* and an empirical radius relation”® for 
low-mass stars), then the planet radius would be revised downwards 
to 2.27R@, which is consistent with a water-dominated composition 
without the need for a gaseous envelope. If the empirical relation’ for 
angular diameter can be extended to this spectral type, this would 
provide an alternative estimate of the stellar radius, given a refined 
estimate of the parallax. 

We considered the timescale for hydrodynamic escape of a hydro- 
gen-dominated envelope. Assuming that the ultraviolet luminosity 
of the star is 10° of its bolometric luminosity (typical” for inactive 
field M dwarfs), we calculate” a hydrodynamical escape rate of 
9X 10°kgs '; we further verified that at the sonic point the mean 
free path is only 4% of the scale height. At this rate, the minimum- 
mass envelope described above would be removed in about 700 Myr. 
The stellar ultraviolet radiation was probably much larger when the 
star was young, which would result in an even shorter timescale for 
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Figure 3 | Masses and radii of transiting planets. GJ 1214b is shown asa red 
filled circle (the 1¢ uncertainties correspond to the size of the symbol), and 
the other known transiting planets are shown as open red circles. The eight 
planets of the Solar System are shown as black diamonds. GJ 1214b and 
CoRoT-7b are the only extrasolar planets with both well-determined masses 
and radii for which the values are less than those for the ice giants of the Solar 
System. Despite their indistinguishable masses, these two planets probably 
have very different compositions. Predicted’* radii as a function of mass are 
shown for assumed compositions of H/He (solid line), pure H,O (dashed 
line), a hypothetical’® water-dominated world (75% HO, 22% Si and 3% Fe 
core; dotted line) and Earth-like (67.5% Si mantle and a 32.5% Fe core; dot- 
dashed line). The radius of GJ 1214b lies 0.49 + 0.13 Rg above the water- 
world curve, indicating that even if the planet is predominantly water in 
composition, it probably has a substantial gaseous envelope. 


removal of the envelope. An age of 3-10 Gyr for the star is sup- 
ported™ both by its kinematics (which indicate that it is a member 
of the old disk) and the lack of chromospheric activity from the 
absence of Ha line emission. Moreover, the dominant periodicity 
in the MEarth photometry is 83 days. Stars spin down as they age, 
and a very long rotation would also indicate an old star. Thus we 
conclude that significant loss of atmospheric mass has occurred over 
the lifetime of the planet; the current envelope is therefore probably 
not primordial. Moreover, some (or all) of the present envelope may 
have resulted from outgassing and further photodissociation of 
material from the core. If the composition of the gaseous envelope 
is indeed dominated by hydrogen (whether primordial or not), the 
annulus of the transmissive portion of planetary atmosphere would 
occult roughly 0.16% of the stellar disk during transit and thus pre- 
sent a signal larger than that already studied for other exoplanets’. 
Thus GJ 1214b presents an opportunity to study a non-primordial 
atmosphere enshrouding a world orbiting another star. Such studies 
have been awaited” and would serve to confirm directly that the 
atmosphere was predominantly hydrogen, because only then would 
the scale height be large enough to present a measurable wavelength- 
dependent signal in transit. 

The discussion above assumes that the solid core of GJ 1214b is 
predominantly water. This is at odds with the recently discovered*” 
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CoRoT-7b, the only other known transiting super-Earth. CoRot-7b 
has mass of 4.8Mq, a radius of 1.7R@ and a density of 5,600kgm °, 


indicating a composition that is predominantly rock. The very dif- 
ferent radii of GJ 1214b and CoRoT-7b despite their indistinguish- 
able masses may be related to the differing degrees to which the two 
planets are irradiated by their parent stars: owing to the much greater 
luminosity of its central star, CoRoT-7b has an equilibrium temper- 
ature of about 2,000 K, roughly fourfold that of GJ 1214b. It may be 
that both planets have rocky cores of similar mass and that it is only 
for CoRoT-7b that the gaseous envelope has been removed, yielding 
the smaller observed radius. Alternatively, GJ 1214b may have a 
water-dominated core, indicating a very different formation history 
from that of CoRoT-7b. Such degeneracies in the models'® of the 
physical structures of super-Earths will be commonplace when only 
a radius and mass are available, but at least one method”* has been 
proposed to mitigate this problem in part. The differences in com- 
position between GJ 1214b and CoRoT-7b bear on the quest for 
habitable worlds: numerous planets with masses indistinguishable 
from those of GJ 1214b and CoRoT-7b have been uncovered indir- 
ectly by radial velocity studies, and some of these lie in or near their 
stellar habitable zones. If such cooler super-Earth planets do indeed 
have gaseous envelopes similar to that of GJ 1214b, the extreme 
atmospheric pressure and absence of stellar radiation at the surface 
might render them inhospitable to life as we know it on Earth. This 
would motivate the push to even more sensitive ground-based tech- 
niques capable of detecting planets with sizes and masses equal to that 
of the Earth orbiting within the habitable zones of low-mass stars. 
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A single sub-kilometre Kuiper belt object from a 
stellar occultation in archival data 


H. E. Schlichting’’, E. O. Ofek', M. Wenz’, R. Sari’*, A. Gal-Yam?, M. Livio®, E. Nelan® & S. Zucker’ 


The Kuiper belt is a remnant of the primordial Solar System. 
Measurements of its size distribution constrain its accretion and 
collisional history, and the importance of material strength of 
Kuiper belt objects'*. Small, sub-kilometre-sized, Kuiper belt 
objects elude direct detection, but the signature of their occultations 
of background stars should be detectable*’. Observations at both 
optical’ and X-ray"! wavelengths claim to have detected such occul- 
tations, but their implied abundances are inconsistent with each 
other and far exceed theoretical expectations. Here we report an 
analysis of archival data that reveals an occultation by a body with 
an approximately 500-metre radius at a distance of 45 astronomical 
units. The probability of this event arising from random statistical 
fluctuations within our data set is about two per cent. Our survey 
yields a surface density of Kuiper belt objects with radii exceeding 
250 metres of 2.171% x 10’ deg~*, ruling out inferred surface 
densities from previous claimed detections by more than 5c. The 
detection of only one event reveals a deficit of sub-kilometre-sized 
Kuiper belt objects compared to a population extrapolated from 
objects with radii exceeding 50 kilometres. This implies that sub- 
kilometre-sized objects are undergoing collisional erosion, just like 
debris disks observed around other stars. 

A small Kuiper belt object (KBO) crossing the line of sight to a star 
will partially obscure the stellar light, an event which can be detected in 
the star’s light curve. For visible light, the characteristic scale of diffrac- 
tion effects, known as the Fresnel scale, is given by (Aa/2)'? ~ 1.3 km, 
where a ~ 40 astronomical units (AU) is the distance to the Kuiper belt 
and 4 ~ 600 nm is the wavelength of our observations. 

Diffraction effects will be apparent in the star’s light curve as a 
result of occulting KBOs provided that both the star and the occult- 
ing object are smaller than the Fresnel scale’*'’. Occultations by 
objects smaller than the Fresnel scale are in the Fraunhofer regime. 
In this regime the diffraction pattern is determined by the size of the 
KBO and its distance to the observer, the angular size of the star, the 
wavelength range of the observations and the impact parameter 
between the star and the KBO (see Supplementary Information for 
details). The duration of the occultation is approximately given by 
the ratio of the Fresnel scale to the relative velocity perpendicular to 
the line of sight between the observer and the KBO. Because the 
relative velocity is usually dominated by the Earth’s velocity around 
the Sun, which is 30 kms_', typical occultations only last a short time 
of the order of a tenth of a second. 

Extensive ground-based efforts have been conducted to look for 
optical occultations”’®'**. So far, these visible searches have 
announced no detections in the region of the Kuiper belt (30-60 av), 
but one of these quests claims to have detected some events beyond 
100 Au and at about 15 au (ref. 10). Unfortunately, ground-based sur- 
veys may suffer from a high rate of false-positives owing to atmospheric 


scintillation, and lack the stability of space-based platforms. The 
ground-breaking idea to search for occultations in archival RXTE 
X-ray data resulted in several claimed occultation events'’. Later, 
revised analysis of the X-ray data'®”’ concluded that the majority of 
the originally reported events are most probably due to instrumental 
dead-time effects. Thus, previous reports of optical and X-ray events 
remain dubious” and their inferred KBO abundance is inconsistent 
with the observed break in the KBO size distribution, which was 
obtained from direct detections of large KBOs*~. Furthermore, they 
are also difficult to reconcile with theoretical expectations, which pre- 
dict collisional evolution for KBOs smaller than a few kilometres in 
size*** and hence a lower KBO abundance than inferred from extra- 
polation from KBOs with radii r> 50 km. 

For the past 14 years, the Fine Guidance Sensors (FGS) on board 
the Hubble Space Telescope (HST) have been collecting photometric 
measurements of stars with 40-Hz time resolution, allowing for the 
detection of the occultation diffraction pattern rather than a simple 
decrease in the photon count. We examined four and a half years of 
archival FGS data, which contain about 12, 000 star hours of low 
ecliptic latitude (|b] < 20°) observations. 

Our survey is most likely to detect occultations by KBOs that are 
200-500 m in radius given the signal-to-noise of our data (Sup- 
plementary Fig. 1) and a power-law size distribution with power-law 
index between 3 and 4.5. Occultation events in this size range are in the 
Fraunhofer regime, where the diffraction pattern is independent of the 
occulting object’s shape and the depth of the diffraction pattern varies 
linearly with the area of the object. The theoretical light curves for our 
search algorithm were therefore calculated in this regime. We fitted 
these theoretical occultation templates to the FGS data and performed 
7 analysis to identify occultation candidates (see Supplementary 
Information). We detected one occultation candidate, at ecliptic lati- 
tude 14°, that significantly exceeds our detection criterion (Fig. 1). The 
best-fit parameters yield a KBO size of r= 520 + 60m and a distance 
of 4573 AU where we assumed a circular KBO orbit and an inclination 
of 14°. Using bootstrap simulations, we estimate a probability of ~2% 
that such an event is caused by statistical fluctuations over the whole 
analysed FGS data set (Supplementary Fig. 5). We note that for objects 
on circular orbits around the Sun two solutions can fit the duration of 
the event. However, the other solution is at a distance of 0.07 AU from 
the Earth, and is therefore unlikely. It is also unlikely that the occulting 
object was located in the asteroid belt, because the expected occultation 
rate from asteroids is about two orders of magnitude less than our 
implied rate. Furthermore, an asteroid would have to have an eccent- 
ricity of order unity to be able to explain the duration of the observed 
occultation event. 

Using the KBO ecliptic latitude distribution from ref. 24, our detec- 
tion efficiency, and our single detection, we constrain the surface 
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Figure 1| Photon counts as a function of time of the candidate occultation 
event observed by FGS2. a, The photon count spanning +2s around the 
occultation event. b, The event in detail. The red crosses are the FGS data 
points with Poisson error bars, the dashed blue line is the theoretical 
diffraction pattern (calculated for the 400-700 nm wavelength range of the 
FGS observations), and the pink squares correspond to the theoretical light 
curve integrated over 40-Hz intervals. We note that the actual noise for this 
observation is about 4% larger than Poisson noise owing to additional noise 
sources such as dark counts (about 3-6 counts in a 40-Hz interval), and jitter 
caused by the displacement of the guide star (by up to 10 mas) from its null 
position. The mean signal-to-noise ratio in a 40-Hz interval for the roughly 
half an hour of observations is ~12. The event occurred at Coordinated 
Universal Time UTC 05:17:49 on 24 March 2007. The best-fit 7” is 20.1 with 
21 degrees of freedom. The star has an ecliptic latitude of +14. Its angular 
radius and effective temperature are about 0.3 of the Fresnel scale and about 
4,460 K, respectively. These values were derived by fitting the 2MASS”* JHK 
and USNO-B1 BR” photometry with a black-body spectrum. The position 
of the star is RA = 186.87872°, dec. = 12.72469° (J2000) and its estimated 
V-magnitude is 13.4. The auto-correlation function (excluding lag zero) of 
the photometric time series of this event is consistent with zero within the 
statistical uncertainty. Each FGS provides two independent photomultiplier 
(PMT) readings and we confirmed that the occultation signature is present 
in both of these independent photon counts. We examined the photon 
counts of the other guide star that was observed by FGS1 at the time of the 
occultation and confirmed that the occultation signal is only present in the 
observations recorded by FGS2. We examined the engineering telemetry for 
HST around the time of the event and verified that the guiding performance 
of HST was normal. We therefore conclude that the above occultation 
pattern is not caused by any known instrumental artefacts. 


density around the ecliptic (averaged over —5° << b<5°) of KBOs 
with radii larger than 250m to 2.17#8x10’ deg~? (see 
Supplementary Information Sections 5 and 6). This surface density 
is about three times the implied surface density at 5.5° ecliptic latitude 
and about five times the surface density at 8—20° ecliptic latitude. This 
is the first measurement of the surface density of hectometre-sized 
KBOs and it improves previous upper limits by more than an order of 
magnitude*”>. 

Figure 2 displays our measurement for the sub-kilometre KBO sur- 
face density and summarizes published upper limits from various 
surveys. Our original data analysis focused on the detection of KBOs 
located at the distance of the Kuiper belt between 30 au and 60 au. To 
compare our results with previously reported ground-based detections 
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Figure 2 | Cumulative KBO size distribution as a function of KBO radius for 
objects located between 30 and 60 au. The results from our FGS survey are 
shown in red and are presented in three different ways. (1) The red cross is 
derived from our detection and represents the KBO surface density around 
the ecliptic (averaged over —5° < b<5°) and is shown with 1o error bars. 
The cross is plotted at r = 250 m, which is roughly the peak of our detection 
probability (see Supplementary Information Section 6 for details). (2) The 
upper and lower red curves correspond to our upper and lower 95% 
confidence levels which were derived without assuming any size 
distribution. (3) The region bounded by the two straight red lines falls within 
lo of our best estimate for the power-law size distribution index, that is, 

q = 3.9 + 0.3, which was calculated for low ecliptic latitudes (|b| <5°). 
These lines are anchored to the observed surface density at r= 45 km. For 
comparison, we also show three other lines (green, blue, turquoise). The 
green (long-dashed) line is the observed size distribution of large KBOs (that 
is, r > 45 km), which has q = 4.5, extrapolated as a single power-law to small 
sizes. The blue (short-dashed) line is a double power-law with q = 3.5 
(collisional cascade of strength-dominated bodies) for KBOs with radii less 
than 45 km and q = 4.5 above. The turquoise (dot-dashed) line corresponds 
to q = 3.0 (collisional cascade of strengthless rubble piles) for KBOs below 
45 km in size. All distributions are normalized to N(>r) = 5.4 deg ? ata 
radius of 45 km (ref. 25). In addition, 95% upper limits from various surveys 
are shown in black (refs 17, 18, 16, 9, 15, 14 and 28). We note that a power- 
law index of 3.9 was used for calculating the cumulative KBO number 
density from the RXTE observations. 


beyond 100 au (ref. 10), we performed a second search of the FGS data 
that was sensitive to objects located beyond the classical Kuiper belt. 
Our results challenge the reported ground-based detections of two 
300 m-sized objects beyond 100 au (ref. 10). Given our total number 
of star hours and a detection efficency of 3% for 300-m-sized objects at 
~100 au we should have detected more than twenty occultations. We 
therefore rule out the previously claimed optical detections'® by more 
than 5o. This result accounts for the broad latitude distribution of our 
observations (that is, |b] < 20°) and the quoted detection efficiency of 
our survey includes the effect of the finite angular radii of the guide 
stars at 100 AU. 

The KBO cumulative size distribution is parameterized by 
N(>r) « r' 4, where N(>r) is the number of objects with radii 
greater than 1, and q is the power-law index. The power-law index 
for KBOs with radii above ~45km is ~4.5 (refs 21, 22) and there is 
evidence for a break in the size distribution at about freak ~ 45 km 
(refs 20-22). Hence we use this break radius and assume a surface 
density for KBOs larger than freak (ref. 25) of 5.4 deg? around the 
ecliptic. Accounting for our detection efficiency, the velocity distri- 
bution of the HST observations, and assuming a single power-law for 
objects with radii less than 45 km in size, we find q= cm are (lo 
and 2¢ errors) below the break. Our results firmly show a deficit of sub- 
kilometre-sized KBOs compared to large objects. This confirms the 
existence of the previously reported break and establishes a shallower 
size distribution extending two orders of magnitude in size down to 
sub-kilometre-sized objects. This suggests that sub-kilometre-sized 
KBOs underwent collisional evolution, eroding the smaller KBOs. 
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This collisional grinding in the Kuiper belt provides the missing link 
between large KBOs and dust, producing debris disks around other 
stars. Currently, our results are consistent with a power-law index of 
strength-dominated collisional cascade”, q= 3.5, within 1.30 and 
with predictions for strengthless rubble piles*, q = 3.0, within 2.40. 
An intermediate value of 3 < q< 3.5 implies that KBOs are strength- 
less rubble piles above some critical size, r,< r< 45 km, and strength- 
dominated below it, r< r,. Our observations constrain r, for the first 
time to our knowledge. At the 2a level we find r. > 3 km. 

Using our estimate for the size distribution power-law index 
(q = 3.9) and our KBO surface density for 250-m-sized KBOs at an 
ecliptic latitude of b = 5.5°, which is the ecliptic latitude of the RXTE 
observations of Scorpius X-1, we predict that there should be about 
3.6 X 10° objects of radius 30m per square degree. This is about 150 
times less than the original estimate from X-ray observations of 
Scorpius X-1 that reported 58 events”, and it is about 30 times less than 
the revised estimate from the same X-ray observations, which concludes 
that up to 12 events might be actual KBO occultations'®. Our results rule 
out the implied surface density from these 12 events at 7o confidence 
level. One can reconcile our results and the claimed X-ray detections 
only by invoking a power-law index of q ~ 5.5 between 250 mand 30 m. 
More recent X-ray work reports no new detections in the region of the 
Kuiper belt but places an upper limit of 1.7 x 10'! deg” for objects of 
50m in radius and larger'*. This is consistent with the KBO surface 
density of N(>50 m) = 8.2 X 10° deg” ’ that we derive by extrapolating 
from our detection in the hectometre size range. 

The statistical confidence level on our detection is 98%. However, 
our conclusions that there is a significant break in the size distri- 
bution and that collisional erosion is taking place and the significant 
discrepancy with previously claimed occultation detections rely on 
the low number of events we discovered. These conclusions would 
only be strengthened if this event was caused by an unlikely statistical 
fluctuation or an as-yet-unknown instrumental artefact. 

Ongoing analysis of the remaining FGS data, which will triple the 
number of star hours, together with further development of our 
detection algorithm (that is, including a larger number of light-curve 
templates) holds the promise of additional detections of occultation 
events and will allow us to constrain the power-law index of the size 
distribution further. 
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Photon-by-photon feedback control of a single-atom 


trajectory 


A. Kubanek', M. Koch, C. Sames’, A. Ourjoumtsev!, P. W. H. Pinkse’, K. Murr’ & G. Rempe’ 


Feedback is one of the most powerful techniques for the control of 
classical systems. An extension into the quantum domain is desirable 
as it could allow the production of non-trivial quantum states'* and 
protection against decoherence*®. The difficulties associated with 
quantum, as opposed to classical, feedback arise from the quantum 
measurement process—in particular the quantum projection noise 
and the limited measurement rate—as well as from quantum fluc- 
tuations perturbing the evolution in a driven open system. Here we 
demonstrate real-time feedback control” of the motion of a single 
atom trapped in an optical cavity. Individual probe photons carrying 
information about the atomic position’* activate a dipole laser that 
steers the atom on timescales 70 times shorter than the atom’s oscil- 
lation period in the trap. Depending on the specific implementation, 
the trapping time is increased by a factor of more than four owing to 
feedback cooling, which can remove almost all the kinetic energy of 
the atom in a quarter of an oscillation period’. Our results show that 
the detected photon flux reflects the atomic motion, and thus mark a 
step towards the exploration of the quantum trajectory’>”* ofa single 
atom at the standard quantum limit. 

In contrast to highly energetic charged particles, whose trajectories 
can be observed using ionization detectors, single neutral atoms are 
much harder to track. The reason is that the interaction of neutral 
particles with a detector is much weaker than that of charged particles. 
So far, the most efficient means of detecting single atoms is light 
scattering. However, the scattering rate is limited by the natural decay 
rate of the atom’s excited state and the photons are emitted in random 
directions. Therefore, the signal becomes vanishingly small if rapid 
measurements must be performed, for example in the implementa- 
tion of fast feedback on single atoms perturbed by quickly changing 
random forces. 

In this context, optical cavity quantum electrodynamics provides a 
powerful technique for single-atom tracking and feedback owing to its 
unique observation’®'* and control capabilities”’’, respectively. 
Measurements are faster and more sensitive than in free space, as a 
result of the increased rate of information exchange between the atom 
and the observed cavity field. This makes it possible to estimate the 
atomic trajectory quickly and use this position information to steer the 
atom rapidly in the desired direction. An advantage of sucha strategy is 
that the steering force is automatically synchronized with the atomic 
motion. This is useful if the atom is moving in an anharmonic poten- 
tial (as here) where the oscillation period depends on the oscillation 
amplitude, in which case an actuator operated at a fixed frequency is 
insufficient. More importantly, it makes it possible to control the 
atomic motion even if this motion is unpredictable on timescales as 
short as the oscillation period in the trap. 

Unlike in previous work””®, the actuator uses blue-detuned dipole 
light that pushes the atom towards the area of low light intensity in 
the centre of the cavity’’. This has three benefits, which we found to 
be essential. First, the dipole laser induces hardly any shift in the 


energy levels of the atom, so tracking and steering are largely inde- 
pendent of each other. Second, the dipole laser controls the motion 
perpendicular to the cavity axis, which is the typical escape direction 
for a trapped atom. Third, it leaves the atomic motion along the 
cavity axis unperturbed and does not interfere with cavity cooling 
along this direction. Asa result, we are able to study the deterministic, 
as well as the probabilistic, nature of the atomic trajectory by tuning 
the observation time interval and measuring the system response. 
The system is sketched in Fig. 1. A high-finesse cavity supports a 
TEMoo mode (waist, ~29 um) nearly resonant with the transition 


Trapping potential with low 
toroidal-trap power 


Trapping potential with high 
toroidal-trap power 


40 0 40 r(um) ED) 
40 40 r (um) 
=5] = 
U (mk) U (mk) 
Photon REL 
clicks f 
fe | | eine 


Figure 1| Experimental setup including the feedback loop. An optical cavity 
directs the transmitted light to two single-photon counting modules 
(SPCM1, SPCM2). A real-time processor determines the sums of the photon 
clicks of both detectors in two consecutive time windows, n; and np, of equal 
duration, T. To reduce the kinetic energy of the atom, the algorithm switches 
the toroidal blue-detuned dipole trap to high power if the atom attempts to 
leave the trap (here for n, = n2) and to a low power if the atom returns 
towards the cavity axis (mn, > nz). The power of the toroidal trap is switched 
using an acousto-optical modulator (AOM), superimposed with the probe 
light using a dichroic mirror (DM) and detected using a photomultiplier 
(PM). The resulting trapping potential, U, is plotted as a function of the 
radial distance from the cavity centre, r, for low power (red box) and high 
power (blue box). 
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from the 5°S,/, F= 3, mp = 3 state to the 5’P3,., F= 4, mp = 4 state 
in Rb atoms (wavelength, 780 nm), which results in a maximum 
atom-cavity coupling of g)/2m = 16 MHz, exceeding losses due to 
atomic polarization decay (rate, y/2m = 3 MHz) and cavity field 
decay (rate, K/2m = 1.25 MHz). A weak probe laser excites this mode 
at the input mirror. In addition, a red-detuned dipole-trap laser of 
wavelength 785 nm (not drawn) resonantly excites a second TEMoo 
mode’, confining the atom in the cavity (trap depth, ~1 mK). A 
separate, blue-detuned, dipole trap is implemented at 775 nm and 
consists of a TEM) and a TEMg; mode, each of which is blue- 
detuned with respect to the probe light by two free spectral ranges. 
Together they forma toroidal repulsive trap'’. Because in a cylindrically 
symmetric system circular orbits do not modulate the transmitted 
light, and because cylindrically symmetric forces cannot change the 
angular momentum of the atomic trajectory, we break the cylindrical 
symmetry by making the TEM,, component 50% stronger than the 
TEM) component. The light exiting through the output mirror is 
separated into probe light and trap light, and the probe light is split 
by anon-polarizing beam splitter and detected using two single-photon 
counting modules. The probe laser is almost resonant with the empty 
cavity (detuned by 2m X 100kHz) and is detuned from the Stark- 
shifted atomic resonance by 2m X 20 MHz. For such parameter choices, 
a well-coupled atom will induce a drop in the transmission from 
1 photon per microsecond (for an empty cavity) to typical values as 
low as 0.03 photons per microsecond. 

Our digital feedback algorithm uses the blue toroidal trap as an 
‘actuator’. The input signal is sensitive to the atomic trajectory in real 
time. An increase in the transmission indicates a lower coupling to the 
mode, as happens when an atom leaves the cavity. This information is 
extracted by the feedback processor (ADwin-Pro II system), which 
compares the respective numbers of photon clicks, 1, and mp, registered 
during two consecutive user-defined intervals of equal duration, T, the 
exposure time. Differential feedback routines are used to switch the 
torus potential whenever a turning point of the atomic trajectory in the 
radial direction is registered. To achieve a high efficiency, we apply a 
‘bang-bang’ strategy’*, in which the intensity of the torus potential is 
switched, using an acousto-optical modulator, between two extreme 
powers: 50 nW (low), resulting in an overall trap depth of 1 mK, and 
800 nW (high), corresponding to 2.5 mK. 

We implement two feedback strategies, which work as follows. In 
the ‘normal’ feedback strategy, we decrease the kinetic energy of the 
atom and keep it in the cavity centre. This is done by switching the 
toroidal trap to high power as soon as the atom attempts to leave the 
cavity, and switching it to low power as soon as the atom moves 
towards the cavity axis; see Fig. | and Fig. 2a. In the ‘inverted’ feedback 
strategy, the switching protocol is reversed, increasing the kinetic 
energy of the atom and expelling it from the cavity. 

To be able to follow these strategies as closely as possible, we 
designed fast digital feedback logic that can react to the detection of 
a single photon with a decision-making time of 1.7 Us and a maximum 
switching-process delay of 3 us. This is much faster than the radial 
oscillation, which has a period of ~360 ls, so feedback occurs in real 
time. We note that the transmission signal is modulated at half 
the oscillation period, T,,., of the atom, owing to the symmetry of 
the cavity modes. For exposure times T'~ T,,., the signal becomes 
averaged, increasing the signal-to-noise ratio but delaying the algo- 
rithm. For T< T,,., the algorithm is fast but only partial information 
about the atomic trajectory is acquired. We assume that the atom 
leaves the cavity radially for an increasing photon flux (nm, < m) and 
that the atom returns towards the cavity axis for a decreasing photon 
flux (n, > m). Both feedback strategies are shown in Fig. 2b fora short 
exposure time, T = 10.2 ls. Here the n, and n, values are mostly zero. 
Nevertheless, the normal feedback strategy keeps the photon rate low, 
indicating good confinement of the atom. In contrast, the inverted 
feedback strategy leads to an increasing photon flux. Figure 2c shows 
an example for a longer exposure time, T = 85 ls. Here the integrated 
signal becomes a smooth function, resulting in smoother switching. 
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Figure 2 | Feedback protocol for a single-atom trajectory. a, The idealized 
transmission (blue) and feedback reaction (red) are displayed for an atom 
oscillating in the trap. For normal feedback (left column), the toroidal trap is 
switched to low power for decreasing cavity transmission and to high power 
otherwise. For inverted feedback (right column), the switching behaviour is 
reversed. a.u. arbitrary units. b, The two strategies are shown for a short 
exposure time, T = 10.2 1s. Single photon clicks are indicated in black and 
their integration over the exposure time is indicated in blue. Normal 
feedback keeps the atom in a well-coupled regime, reducing the oscillation 
amplitude, whereas inverted feedback is out of phase with the atom’s 
oscillation and therefore increases the oscillation amplitude to the point at 
which the atom leaves the trap. ¢, For a long exposure time, T = 85 Ls, 
inverted feedback is in phase, reducing the oscillation, whereas normal 
feedback is not. Data for single photon clicks is shown multiplied by a factor 
of five, for clarity. 


Concurrently, a large exposure time also delays the feedback, which 
can be seen by comparing the raw photon clicks (black data) with the 
integrated signal (blue data). For this exposure time, normal feedback 
is out of phase with the radial oscillation and therefore drives the atom 
out of the cavity. As inverted feedback is also delayed by one-quarter of 
an oscillation period, it turns into a damping force. 

Special attention must be paid to the case in which n, = mn. We 
found that the feedback is effective only if we switch the toroidal trap 
to low power for 1; > nm, and to high power for n, = m. A possible 
explanation is that for short exposure times, T, the zero-photon 
events are the most likely for atoms near the cavity axis. For the 
chosen, near-resonant, probing of the system, the photon flux 
depends only marginally on the exact atomic position when the 
coupling strength is large. Hence, if m) = n,=0 the atom could 
already be moving away from the axis, in which case it is prudent 
to switch the toroidal trap to high power. 

To analyse the performance of our feedback strategies quantita- 
tively, we study the atomic storage time, which is mainly determined 
by radial losses of the atom for the chosen parameters. The storage 
time is obtained by fitting an exponential decay to the atomic storage 
probability. The first millisecond is dominated by a rapid loss of 
atoms, presumably those that are not injected in one of the central 
antinodes. Therefore, the first millisecond of data is disregarded 
(fraction of remaining atoms, ~55%). 

Figure 3 shows the atomic storage probability as a function of time 
(plotted in the inset) and the average storage time as a function of 
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Figure 3 | Manoeuvring of a single atom using real-time feedback. Average 
storage time plotted as a function of exposure time for two feedback 
strategies. The solid lines show the results of Monte Carlo simulations for the 
inverted (magenta) and normal (blue) feedback strategies. The 
corresponding experimental data points are plotted in the same colours. The 
dotted green and red lines show the experimental storage times without 
feedback for high and, respectively, low toroidal-trap power, setting the 
boundary between feedback cooling (longer storage time) and feedback 
heating (shorter storage time). The standard deviations (not shown) 
obtained from exponential fits to the decay (see text) are less than 0.18 ms. 
The inset shows the experimental atom-loss dynamics without feedback 
(green and red dashed lines, colour-coded as in main figure) and illustrates 
how the feedback changes the slope of the exponential decay. Whereas the 
slope is less steep for normal feedback (T = 17 kts; blue), it is more steep for 
inverted feedback (T = 25.5 1s; magenta). 


exposure time, T, ranging from a few microseconds to T ~ 140 us or, 
in other words, from a small fraction of the oscillation period to 
about three-eighths of it. The dotted red and green lines are the 
storage times with the toroidal trap at low and, respectively, high 
power without feedback. When T is less than ~7 us, the noise 
dominates the signal and the algorithm mainly reacts to the random 
arrival of photons. This results in a heating of the atom and a decrease 
in the storage time to below the zero-feedback value, for both normal 
(blue points) and the inverted (magenta points) feedback. Increasing 
the exposure time, however, increases the storage time, with a 
maximum at T~15pus for normal feedback. This is remarkable, 
given that only a few photons have been detected. In contrast, apply- 
ing inverted feedback at this value of T leads to a storage time less 
than the value obtained without feedback. Here we are heating the 
atom by acting out of phase with its motion. Furthermore, we see that 
inverted feedback turns into a cooling mechanism for longer expo- 
sure times, reaching a maximum at T ~ 90 jis. The maximum storage 
time for inverted feedback is less than that obtained for normal 
feedback. This directly shows that it is important to apply the feed- 
back before the atomic motion becomes unpredictable. 

To understand the data better, we performed Monte Carlo simula- 
tions of the experiment. As shown by the solid lines in Fig. 3, there is 
good agreement between the simulations and the measurements. The 
storage times are well reproduced if we add a loss mechanism other 
than cavity-induced heating. The main reason for this loss is the 
possibility of off-resonant pumping of the atom into the “dark’ hyper- 
fine state, in which F = 2 (ref. 19). We analysed this effect by adding a 
repumper laser perpendicular to the cavity axis. From the increase in 
the storage time without feedback, we can estimate the additional loss 
rate to be between 1/5.5ms' and 1/14ms'. Including a rate of 
1/7 ms ' in the simulation shows good agreement with the experi- 
mental data. We also found experimentally that the storage time with 
feedback increases to ~15 ms when the repumper laser is added. 

We next analyse the atomic motion by imposing limits on the 
average transmitted power (and, thus, on the average atom-—cavity 
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coupling strength) in the data evaluation”. Photon correlations now 
reveal further information on the dynamics of atomic motion, as 
shown in Fig. 4. Here photon bunching is caused by fluctuations in 
the coupling strength. For measurements with a weak atom-—cavity 
coupling, corresponding to a transmission of up to 0.6 times that of 
the empty cavity, the oscillation period is ~2 X 260 ts (dashed line). 
If the transmission is restricted to be less than 0.05 times that of the 
empty cavity, the oscillation period decreases to ~2 X 180 Ls (solid 
line). Comparison with Fig. 3 therefore shows that the maximum in 
the average storage time at ~90 [ls corresponds to one-quarter of the 
oscillation period for a well-coupled atom. A further comparison 
with simulations shows that the feedback can keep the atom close 
to the cavity axis, with an average excursion of less than 4.5 um. This 
is a factor of ~2 less than the value obtained without feedback. We 
note that the crossing from in-phase behaviour to out-of-phase beha- 
viour occurs for T ~ 45 us (Fig. 3), which is one-eighth of the oscil- 
lation period for a well-coupled atom. In addition, the random 
character of the atomic motion is visible in the damping of the 
correlation function, indicating a decoherence time of ~200 us. 
The dependence of the oscillation period on the localization is shown 
in the inset of Fig. 4. It shows the anharmonicity of the atomic 
motion; a harmonic potential would result in a horizontal line. 
Further improvement of the feedback algorithm is possible and has 
already been realized by combining a fast reaction time with smooth 
switching behaviour: a fast loop with a short exposure time, 
Tshort = 8-5 Ls, is responsible for switching the toroidal trap to low 
power whenever the atom is observed to move towards the cavity axis 
(1, > n). Proper switching of the trap back to high power when the 
atom is close to the cavity axis is more critical, as here the photon flux 
from the cavity is low. To improve the signal-to-noise ratio, we added 
a loop with a longer exposure time, Tiong = 34 Ms, corresponding to 
about T,,./8. The trap is switched back to high power only if both 
loops register n, = mp. Using this improved algorithm, we increased 
the average storage time to about 24 ms, with maximum observed 
trapping times exceeding 250 ms. With the repumper laser on, the 
overall improvement factor of the storage time with feedback (24 ms) 
relative to that without feedback (<6 ms) is larger than four. Our 
simulations indicate that storage times of 0.5 s can be realized when 
spurious photon clicks (in our experiment mainly stemming from 
repumper light scattered from the edges of the cavity mirrors) are 
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Figure 4 | Dynamics of atomic motion from photon correlation 
measurements. The photon correlation function, g(t); is shown as 
function of the correlation time, t, for strongly coupled atoms (solid line) 
and loosely coupled atoms (dashed line). The atomic motion leaves its 
signature in the correlations, allowing the determination of the oscillation 
period as well as the decoherence time of the oscillation, ~200 us. The 
standard deviations (not shown) of the g(t) data are typically less than 
0.15. The inset shows the dependence of the oscillation period on the 
transmission normalized using the empty-cavity transmission, 
characterizing the anharmonicity of the trapping potential. 
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eliminated. This would make the feedback scheme compatible with 
state-of-the-art laser cooling techniques, but with the advantage that 
one-dimensional optical access is sufficient for three-dimensional 
control. 

The successful realization of feedback on an a-priori unpredictable 
atomic trajectory shows that reliable position information can be 
obtained from continuous (or quasi-continuous) measurements. 
Once extended into the quantum domain, this might make it possible 
to stabilize the quantum state of a trapped particle or observe the 
quantum Zeno effect for a free particle*'. Additional feedback might 
then make it possible to steer an individual quantum trajectory with a 
precision ultimately determined by Heisenberg’s uncertainty relation. 


Received 17 August; accepted 7 October 2009. 
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Photon-induced near-field electron microscopy 


Brett Barwick’, David J. Flannigan’ & Ahmed H. Zewail’ 


In materials science and biology, optical near-field microscopies 
enable spatial resolutions beyond the diffraction limit'’, but they 
cannot provide the atomic-scale imaging capabilities of electron 
microscopy’. Given the nature of interactions** between electrons 
and photons, and considering their connections” through nano- 
structures, it should be possible to achieve imaging of evanescent 
electromagnetic fields with electron pulses when such fields are 
resolved in both space (nanometre and below) and time (femto- 
second)''*. Here we report the development of photon-induced 
near-field electron microscopy (PINEM), and the associated 
phenomena. We show that the precise spatiotemporal overlap of 
femtosecond single-electron packets with intense optical pulses at 
a nanostructure (individual carbon nanotube or silver nanowire in 
this instance) results in the direct absorption of integer multiples of 
photon quanta (nim) by the relativistic electrons accelerated to 
200 keV. By energy-filtering only those electrons resulting from 
this absorption, it is possible to image directly in space the near- 
field electric field distribution, obtain the temporal behaviour 
of the field on the femtosecond timescale, and map its spatial 
polarization dependence. We believe that the observation of the 
photon-induced near-field effect in ultrafast electron microscopy 
demonstrates the potential for many applications, including those 
of direct space-time imaging of localized fields at interfaces and 
visualization of phenomena related to photonics, plasmonics and 
nanostructures. 

Imaging in conventional electron microscopes is based on elastic 
interactions of electrons with matter; that is, with no energy loss or 
gain. With variant techniques, these scattering processes at different 
angles provide structural and bonding information from images and 
diffraction patterns*"*. In contrast, the interaction is inelastic in elec- 
tron spectroscopy and what is observed normally is the energy loss 
due to the valence/core electron excitations of matter; as such, elec- 
tron energy loss spectroscopy (EELS) is a powerful analytical tool. 
When images, diffraction, or electron spectra are time-resolved in 
electron microscopy, photons are typically used to initiate a change 
for the study of ultrafast structural dynamics, which occur on the 
femtosecond and longer timescale'''’. But, before these structural 
changes, electronic distributions are altered, with their dynamical 
changes being on the femtosecond and shorter timescale; they are 
directly the result of photon—matter interaction. Photon—electron 
interactions are of a different nature. 

In free space, an electron cannot absorb a quantum of electro- 
magnetic energy because of the lack of energy-momentum conser- 
vation. However, as suggested in 1933‘, absorption followed by 
stimulated emission can occur when two (counter-propagating) 
photons are used. Indeed, the experimental verification was made 
in 2001°, and earlier in 1988 it was demonstrated® that an intense 
standing optical wave can result in momentum transfer to free elec- 
trons with scattering rates approaching that of the optical frequency. 
These photon-electron interactions are basic to attosecond pulse 
generation’ and to multiphoton harmonics and the laser-assisted 
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Figure 1| Electron energy spectra of carbon nanotubes irradiated with an 
intense fs laser pulse at two different delay times. a, The zero-loss peak 
(ZLP) of the 200 keV electrons (black curve) taken when the electron packet 
arrives before the femtosecond pulse; in this spectrum only the plasmon 
peaks are present (see text). The energy spectrum at coincidence of the two 
pulses (t = 0 fs; red curve) displays the multiple quanta of photon 
absorption/emission. Inset, the positive energy gain region multiplied by 5 
for the t = 0 spectrum, indicating that absorption of at least eight quanta of 
photon energy can be observed at maximum spatiotemporal overlap. 

b, Magnified view of the electron energy spectrum obtained at t = 0. The 
energy is given in reference to the loss/gain of photon quanta by the electrons 
with respect to the zero-loss energy. 


surface photoelectric effect*. In 1966, the effect of lattice vibrations 
in electron energy loss and gain was reported'’, and more recently it 
was suggested” that photon-electron interactions can similarly be 
exploited in various connections. Of special interest is the possible 
use of continuous-wave (CW, no time resolution) lasers to provide 
high resolution spectra of plasmons and other features'®. As shown 
below, only when the field is induced and probed on the ultrashort 
timescale would it be visualized and controlled for applications in 
imaging and spectroscopy. 

Unlike all previous ultrafast electron microscopy (UEM) studies 
from this laboratory’, the imaging and spectroscopy methods 
reported here rely on the strong interaction between photons and 
electrons. Furthermore, because structural dynamics commence after 
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Figure 2 | Photon-induced near-field electron microscopy of an individual 
nanotube. a, Top, bright-field image shown for reference (time-averaged, 
unfiltered); the average diameter across the tube is 147 + 20 nm; scale bar, 
500 nm. Middle, the spatial field gradient (of length scale L) in image counts. 
Bottom, the decay of counts with time T. See text for details. The profile in 
red shown in the middle plot was obtained from the t = 0 frame, and the 
black exponential curve is displayed simply to illustrate the typical length 
scale of an evanescent field. b, The nine energy-filtered UEM images 
acquired by using only the electrons that have gained energy (up to n = 4) 


the photon-induced near-field (PIN) effect diminishes, here after 
400 fs, imaging of the nanostructure electronic properties beyond this 
time is no longer possible. Significantly, on this electron—photon 
interaction timescale, and using intense pulses with peak irradiances 
of the order of 100 GW cm’, the 200 keV electron packets lose and 
gain energy in discrete quanta that are integer multiples of the tuned 
photon energy. As reported below, at least eight photons of absorp- 
tion/emission were observed despite the fact that the interaction time 
with the nanostructure is only a few hundred attoseconds, given the 
electron speed and path length in the nanostructure. Moreover, by 
energy-filtering the zero-loss peak (ZLP), we are able to obtain the 
images formed as a result of elastic scatterings; remarkably, when 
selecting only the electrons that have gained quanta of photon energy, 
the evanescent electric field was visualized in real space images of the 
nanostructure. The field polarization and temporal behaviour are 
different from those of bulk structural transformations. 

The experiments were performed on an individual and a collection of 
multiwalled carbon nanotubes with diameters of ~140 nm and lengths 
of ~7 um (Aldrich, >90% purity), and on silver nanowires with dia- 
meters of ~100 nm and lengths ranging from 2 to 20 1m (ref. 16). Both 
femtosecond-resolved electron energy spectra and energy-filtered 
photon-induced images were recorded in our second generation, ultra- 
fast electron microscope (UEM-2). All experiments reported here were 
conducted in the single-electron regime (0.1-1 e per packet at the 
detector) in order to eliminate space-charge effects; a repetition rate 
of 500kHz and a fluence of 14mJcm * for carbon nanotubes 
(1.2 mJ cm’ for silver nanowires) were typically used. The output from 
a laser emitting a train of 220 fs pulses centred at 1,038 nm was split into 
two arms, one of which was frequency doubled and used to excite the 
nanostructure, and the other was frequency tripled and used to generate 
the electron packets at the photocathode source in UEM-2. The 
femtosecond timing between the frequency doubled laser pulse and 
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relative to the ZLP; for clarity, the images are displayed in false colour (see 
bar at lower right). Blue indicates regions of the CCD where no counts were 
recorded because we selected only the +nho region. The time of arrival of 
the electron packet at the nanotube relative to the clocking laser pulse is 
shown in the upper left corner of each image. The electric field of the clocking 
laser pulse was linearly polarized perpendicular to the long-axis of the 
nanotube (compare Fig. 3). The counts indicated in red represent the fields 
created by the femtosecond pulse around the surface of the nanostructure 
and their decay with time. 


the electron packet at the specimen was controlled by an optical delay 
line; the apparatus has been described in detail elsewhere''””. 

Shown in Fig. 1 are temporally resolved electron energy spectra 
obtained from the carbon nanotubes irradiated with the intense femto- 
second laser pulse. For comparison, two separate electron spectra are 
shown (Fig. 1a): one obtained when the electron packet arrived before 
the femtosecond pulse, which we call the t= —2 ps spectrum, and a 
second one recorded when the electron and photon pulses were con- 
figured for a maximum overlap, the t= 0 fs spectrum. Upon inspec- 
tion of the change, one immediately notices that at the maximum 
overlap (t= 0) the spectrum consists of discrete peaks of decreasing 
intensity on both the lower- and higher-energy side of the ZLP. For the 
t= —2 ps spectrum, however, the discrete peaks are absent and only 
the m and +o plasmon peaks at 6 and 25eV, respectively, are 
observed"*. 

A magnified view of the electron spectrum obtained at t = 0 reveals 
that the discrete peaks on both sides of the ZLP occur at integer 
multiples of the photon energy of the exciting femtosecond pulse 
(2.4 eV; Fig. 1b). From these spectra, it is apparent that the discrete 
peaks occur as a consequence of the interaction of the 200 keV ultra- 
fast electron packet with the 2.4eV femtosecond photon pulse. On 
this timescale, the large influence of the PIN effect is illustrated by the 
substantial decrease in the ZLP intensity at maximum overlap. From 
the recorded electron-energy spectra, we found that electrons of the 
ultrafast packets can absorb more than eight photons during the brief 
interaction with the nanostructure (Fig. la inset). It is important to 
note that the spectra shown in Fig. 1 (loss/gain) are observed only in 
the presence of the nanostructure. 

On the length scale (d) of the nanostructure, relative to that of 
the photon (wavelength A), the interaction between photons and 
electrons (that is, free—free transitions) is greatly enhanced by the 
evanescent field that is created through the excitations of the carbon 
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nanotube, and similarly for the silver nanowires, without which 
energy-momentum conservation is impossible'*’’. The probability 
of electron—photon coupling in the presence of a third body (for 
example, atom, molecule, or a surface) increases as the electron 
energy increases for a fixed laser intensity and wavelength’, and such 
a characteristic is ideal for UEM at 200 keV. Free—free transitions in 
the electron continuum, without perturbations from the third body, 
become descriptive of the process when the electron has a much 
higher energy than the photon**®*!. Whereas plasmonic fields 
induced by the femtosecond pulse can follow the laser electric field”, 
the relaxation time for metallic nanoparticles is of the order of tens to 
hundreds of femtoseconds, depending on the damping (recombina- 
tion) processes involved”*. Multiwalled carbon nanotubes are metallic 
in nature”, as are silver nanowires, and for such metallic nano- 
structures absorption is enhanced at lower energies”. 

In Fig. 2, the effect of energy-filtering on imaging is displayed at 
different times. By using an energy filter (n= 1 to 4; 10 eV total 
width) to select only those electrons that have absorbed energy to 
form an image, the evanescent field generated by the femtosecond 
excitation pulse became evident in real-space images of the isolated 
(individual) nanotube. Further, by varying the arrival time of the 
electron packet at the carbon nanotube relative to the clocking femto- 
second pulse, the ultrafast evolution of the evanescent field was fol- 
lowed in real time. As can be seen in Fig. 2, image counts appear only 
within the local vicinity of the surface of the carbon nanotube; no 
energy gain occurs far from the nanostructure or within the tube 
itself. 

The energy-filtered image generated by selecting only energy- 
gained electrons and obtained at t= —600 fs shows almost no counts. 
As the temporal overlap increases, however, the counts due to the 
evanescent optical field increases and reaches a maximum at t= 0 
(that is, maximum overlap) before decreasing again to almost zero at 
t= +600 fs. In addition to revealing the rise time of the evanescent 
field to be much less than one picosecond, the sequence of images in 
Fig. 2b also shows that the field extends at the interface to ~50 nm (1/e) 
into vacuum on either side of the nanotube. The image length scale is 
consistent with theoretical considerations of optically excited plas- 
mons”. The time necessary for the image counts to decay from the 
maximum to minimum values, normalized by the maximum change 
in counts per unit time (Fig. 2a, bottom) is 130 fs, reflecting the rate of 
change, as discussed below. A movie of the time dependent behaviour of 
the carbon nanotubes is included in the Supplementary Information. 

The ultrafast response shown in the discrete energy gain and loss 
(that is, bands on both sides of the ZLP; Fig. 1) can be quantified in 
energy and time space (Fig. 3). In order to obtain the intensity profile, 
each energy spectrum was fitted to a series of Gaussians having the 
form: 


AY (E) = |-2a eB [20° + > a, eo (Banna) /207 


V210 tn 210" (1) 


where « is a sum over a,, the amplitude of the mth photon process, 
and o reflects the energy width. A typical fit of equation (1), which 
has also been invoked in photoelectron studies*, to the observed 
spectrum is shown in Fig. 1b. In Fig. 3a, the temporal dependence 
of different sidebands and the ZLP is plotted on a log scale. With 
Gaussian analysis in the time domain, we obtained the time constants 
involved. For the +3h«@ peaks, t, = 420 fs, which is a direct result of 
the convolution of the femtosecond excitation pulse, the electron 
packet, and response time of the evanescent optical field; the femto- 
second optical pulse duration t, = 220 fs (ref. 11) and energy-filtering 
may be significant in reducing electron pulse energy dispersion”’. All 
peaks were fitted similarly, giving the range of t, to be 510 + 90 fs. 
Because of the geometry involving the two pulses used in UEM-2, the 
(axial) group velocity mismatch is irrelevant here, as it results in a 
dispersion of ~1 fs over the 100 nm path length. 
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Figure 3 | Temporal response and polarization dependence of the imaged 
interfacial fields. a, Temporal dependence of the ZLP and representative 
peaks of +nhw plotted on a log scale. Given are the values for t, of the 
femtosecond optical pulse and t, of the fitted transient; see text. The t= 0 
position is determined by the temporal response of the ZLP’’. Inset, linear 
contour plot of the data fitted by the Gaussians. b, Two images taken when 
the E-field polarization of the femtosecond laser pulse is parallel to (left 
image) and perpendicular to (right image) the long-axis of the nanotube. 
Both polarization frames were taken at t = 0, when the interaction between 
electron, photons and the evanescent field is at a maximum. 


Besides the temporal and energy domain observations, we also 
examined the spatial distribution of the near field from images taken 
for different polarizations of the femtosecond pulse relative to the 
orientation of an individual nanotube (or wire). Shown in Fig. 3b are 
UEM images obtained at t= 0 with the E-field of the femtosecond 
laser pulse polarized either parallel (left frame) or perpendicular 
(right frame) to the long axis of the carbon nanotube. These frames 
display striking changes in the images: when the laser polarization 
is positioned appropriately relative to the nanotube orientation, a 
spatial enhancement of the evanescent field is observed in the UEM 
images. This is because for this case the confinement is in the regime 
of d< J. On the other hand, for the other polarization, when d> /, 
the tip enhancement is seen when the polarization changes by 90°. 
For this case, certain spatial modes may be formed with unique dis- 
tributions (Fig. 3 and Supplementary Information). The precise dis- 
tribution of the field is dependent upon the nanoscale geometry of 
the specimen”*”», and the fact that the apex of the tip has a decrease in 
counts at perpendicular polarization is consistent with nanometric- 
scale field calculations”’. 

The above experiments on carbon nanotubes were repeated using 
silver nanowires, obtaining similarly the filtered energy gain images, 
the electron spectra, and the polarization dependence (Supplementary 
Fig. 1). The irradiance needed, however, was an order of magnitude 
lower (10GW cm *), consistent with the stronger near-field formed 
in the metallic nanowire and with the difference in material property. 
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Figure 4 | Physical depiction of the interaction between the electron, 
photon and the evanescent field. a, A frame when the electron packet 
arrives at the nanotube before the femtosecond laser pulse (t < 0); no 
spatiotemporal overlap has yet occurred. b, The precise moment at t = 0 
when the electron packet, femtosecond laser pulse and evanescent field are at 
maximum overlap at the carbon nanotube. ¢, Illustration of the process 


The polarization dependence was the same as that of the carbon 
nanotubes, owing to the similarity of the geometrical structure of 
the nanotubes and nanowires. 

From the above UEM results, the PIN picture can be illustrated by 
considering the spatiotemporal coordinates of the three bodies 
involved (Fig. 4). At negative times (t<0), we can visualize the 
femtosecond laser pulse as not impinging on the nanostructure, 
and no discrete (nfm) electron energy gain or loss would be observed. 
When the laser pulse encounters the nanostructure (t= 0), it creates 
the near-field excitations, and this interaction causes the surface field 
to oscillate with the electric field of the laser**. Because the nanotube 
diameter (~140 nm) is much less than the wavelength of the light 
(519 nm), the field is confined (d< /) by the dimensions of the tube 
(wire), and this confinement sets up an oscillating dipole in the 
structure. The intensity of the evanescent field extends beyond the 
structure of the nanotube and falls off exponentially with distance 
from the surface*”’. Thus, evanescent fields effectively mediate the 
interaction between the 200 keV electron and the 2.4 eV photons in 
the femtosecond excitation pulse, but the absorption/emission pro- 
cesses only occur when both the electron and photon are overlapped 
in space at the nanostructure and in time at t= 0. 

The orders of magnitude enhancement achieved in UEM may be 
appreciated when comparing with time-averaged, CW mode of 
excitation. For a tightly focused CW laser (10° W cm”), the number 
of excitations on the timescale of the field is nearly five orders of 
magnitude less than achieved in UEM using ~100 GW cm ” irradi- 
ance. Further, for CW powers of about 10 W, it would be necessary 
for the nanostructures to dissipate the energy without significant 
structural damage. In UEM, typically the average power is of the 
order of 100 mW. Perhaps most importantly, the precise overlap of 
pulses in UEM allows for signal acquisition times of only a few 
seconds, as every electron contributes to the gain/loss signal on the 
timescale of the field’s existence. In contrast, for CW electron spec- 
troscopy, the signal will be overwhelmed by a background whose 
magnitude is conditioned by the repetition rate and other factors. 
Finally, we note that the process of +nhw absorption/emission 
reported here takes place for each single-electron, timed packet. 

Photon-induced near-field imaging with electrons is made possible 
by the precise overlap of ultrafast electron packets, intense ultrafast 
laser pulses, and nanostructures. These structure-mediated (electron— 
photon interaction) phenomena, as well as the spatiotemporal 
properties of the evanescent electric fields, can now be imaged in real 
space and on the femtosecond timescale. By knowing the distribu- 
tion of the field and the control over its polarization and temporal 
behaviour, it is possible to explore the nature of interfacial fields and 


during and immediately after the interaction (t > 0) when the electron gains/ 
loses energy equal to integer multiples of femtosecond laser photons. Inset, 
the possible final energies in the continuum due to the free-free transitions 
between the imaging electron and the photons in the femtosecond laser 
pulse. KE, kinetic energy. 


their role in a variety of applications at the nanoscale of materials**. In 
optics, near-field imaging is now known to break the diffraction limit, 
but with limited spatial and temporal resolutions. In the present con- 
tribution, PINEM promises to take the resolutions into the domain of 
ultrafast electron microscopy, the atomic scale. Moreover, with 
PINEM, which exploits inelastic interactions, the real space images 
(and diffraction), which are the result of elastic interactions, can easily 
be obtained by removing the energy filter; the scanning requirement of 
optical near-field methods is not of concern here. Because the method 
involves surface electrons, it also can bring about imaging with sub- 
femtosecond electrons*”””. 
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Fault zone fabric and fault weakness 


Cristiano Collettini’, André Niemeijer’+, Cecilia Viti? & Chris Marone? 


Geological and geophysical evidence suggests that some crustal 
faults are weak’* compared to laboratory measurements of fric- 
tional strength’. Explanations for fault weakness include the pres- 
ence of weak minerals‘, high fluid pressures within the fault core*? 
and dynamic processes such as normal stress reduction"’, acoustic 
fluidization'' or extreme weakening at high slip velocity’*™. 
Dynamic weakening mechanisms can explain some observations; 
however, creep and aseismic slip are thought to occur on weak 
faults, and quasi-static weakening mechanisms are required to 
initiate frictional slip on mis-oriented faults, at high angles to 
the tectonic stress field. Moreover, the maintenance of high fluid 
pressures requires specialized conditions’> and weak mineral 
phases are not present in sufficient abundance to satisfy weak fault 
models’, so weak faults remain largely unexplained. Here we pro- 
vide laboratory evidence for a brittle, frictional weakening mech- 
anism based on common fault zone fabrics. We report on the 
frictional strength of intact fault rocks sheared in their in situ 
geometry. Samples with well-developed foliation are extremely 
weak compared to their powdered equivalents. Micro- and nano- 
structural studies show that frictional sliding occurs along very 
fine-grained foliations composed of phyllosilicates (talc and 
smectite). When the same rocks are powdered, frictional strength 
is high, consistent with cataclastic processes. Our data show that 
fault weakness can occur in cases where weak mineral phases con- 
stitute only a small percentage of the total fault rock and that low 
friction results from slip on a network of weak phyllosilicate-rich 
surfaces that define the rock fabric. The widespread documenta- 
tion of foliated fault rocks along mature faults in different tectonic 
settings and from many different protoliths*’”-’ suggests that this 
mechanism could be a viable explanation for fault weakening in 
the brittle crust. 

Laboratory measurements on a wide variety of rock types show 
that fault friction y is in the range 0.6—0.8, independent of rock type, 
with the exception of a few weak minerals’. Several lines of evidence 
suggest that Byerlee’s friction with 4. = 0.6 is applicable to many 
faults within the brittle crust”, including in situ stress measurements 
showing that failure occurs on optimally oriented faults with normal 
friction and nearly hydrostatic pore pressure*'. Moreover, the 
absence of moderate-to-large earthquakes along severely misoriented 
faults in both extensional and compressional environments is con- 
sistent with laboratory friction values”. In contrast, a number of 
studies have suggested that some mature crustal faults are weak 
compared to laboratory friction values'**°’. Dynamic weakening 
mechanisms may explain some of these data. High speed friction 
experiments have shown that friction decreases above a threshold 
slip rate'** and such weakening is consistent with observations of 
dynamic rupture'®"’”’. However, a number of observations indicate 
that fault creep, aseismic slip and slip on unfavourably oriented faults 
with respect to the applied stress occur at low resolved shear stress, 
which implies that some faults are statically weak’***”, that is, 


friction is low in the long term including both co-seismic and inter- 
seismic phases. 

To investigate the role of fabrics in long term fault weakening, we 
conducted friction experiments on fault rocks obtained from the 
Zuccale fault, which is a low-angle normal fault exposed on the Isle 
of Elba (Fig. 1) in central Italy. Geological data suggest that the 
Zuccale fault has accommodated a total shear displacement of 
6-8 km along a dip angle of about 15° in a stress field with a vertical 
maximum compression; that is, the fault was weak”. The fault zone is 
characterized by a foliated and lithologically heterogeneous fault core 
several metres thick. The adjacent hangingwall and footwall rocks 
deformed predominantly by brittle cataclastic processes. Within 
the fault core, however, a significant amount of deformation was 
accommodated within highly foliated phyllosilicate-rich horizons 
that represent deformation processes that occurred at less than 
8 km of crustal depth”. The foliated microstructure probably formed 
in the early geological stages of fault activity by dissolution and pre- 
cipitation processes. Once formed, the phyllosilicate-rich network 
deforms predominantly by frictional sliding along the phyllosilicate 
foliae™*. 

We chose rock samples from two foliated zones (Fig. 1) for detailed 
friction studies (see Methods). Zone L2 is composed of massive 
calcite-rich portions and calcite sigmoids dispersed within a fine- 
grained (<2m) foliation made of tremolite and phyllosilicate 
(smectite, talc and minor chlorite). Zone L3 contains both large 
crystals of calcite and agglomeration of tremolite fibres in random 
orientation, within a foliation made of fine-grained tremolite and 
phyllosilicates (smectite, talc and minor chlorite). We collected 
undisturbed samples of the fault rocks and cut them into wafers 
0.8-1.2 cm thick and 5cm X 5 cm in area. We refer to these as solid 
experiments. We also carried out experiments on powders obtained 
from crushing and sieving (<150 um) the solid samples. With this 
approach we directly compare the frictional properties of foliated 
fault rocks and their powdered equivalents, which have identical 
mineralogical composition (Table 1). 

Frictional sliding experiments were carried out in the double direct 
shear configuration (Fig. 2a inset) at 25 °C and over a range of normal 
stresses from 10 to 150 MPa and shear slip velocities of 1 to 300 ums”! 
(see Methods). All experiments are characterized by an initial strain 
hardening, where the shear stress increases rapidly during elastic load- 
ing, before a yield point, followed by shear at a steady-state value of 
frictional stress (Fig. 2a). We measured the steady state, residual, fric- 
tional shear stress for intact fault rocks and powders at each normal 
stress. Each rock type plots along a line consistent with a brittle failure 
envelope (Fig. 2b), but the solid wafers are much weaker than their 
powdered analogues. In particular, the powders show friction of 
about 0.6, whereas the foliated rocks have significantly lower values 
(0.45—0.20). We note that at each normal stress, foliated fault rocks 
have a friction coefficient that is 0.20.3 lower than the powders made 
from them (Fig. 3). The majority of our experiments were conducted 
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Figure 1| Example of a foliated low-angle normal fault. a, Map and cross- 
section of the Zuccale fault (ZF) as exposed on the Isle of Elba, Central Italy. 
The Zuccale fault is an exhumed, ancient low-angle normal fault belonging 
to the Apenninic system. In the tectonically active area of the Apennines, to 
the east, a similar low-angle normal fault is producing microseismicity at 


without water and at room temperature, but we also ran experiments 
under water-saturated conditions and these indicate further weaken- 
ing for intact fault rocks (for example, Fig. 3). 

Microstructural studies of the tested rocks show that the sliding 
surfaces of the foliated solid wafers are located along the pre-existing 
very-fine-grained foliation made of tremolite and phyllosilicates 
(Fig. 4a and c). Transmission electron microscope (TEM) images 
show that the foliation is made of tremolite clasts surrounded by 
an interconnected network of phyllosilicates, including smectite 
and talc (Fig. 4e and f). The through-going microstructure is affected 
by translation and rotation of the (001) phyllosilicate layers with 
frequent interlayer delaminations, resulting in talc grain size reduc- 
tion, down to 20nm in thickness. This anastomosing network of 
phyllosilicate lamellae appears to provide a multitude of possible 
slipping planes for frictional shear under a low stress. In contrast, 
our experiments conducted on powder composed of the same mate- 
rials indicate that much of the deformation occurs along zones charac- 
terized by grain-size reduction and affected by shear localization along 
RI, Y and B shears (Fig. 4b, for example**°). The slipping zones of 
powdered samples are characterized by abundant calcite clasts 
immersed in a groundmass consisting of fine-grained tremolite and 
subordinate phyllosilicates (Fig. 4d). 

Although the foliated wafers of intact fault rock and their powders 
have identical mineralogical compositions, the foliated samples are 
much weaker than their powdered analogues. On the basis of micro- 
structural studies, we propose that weakness of the foliated fault 
rocks is due to the reactivation of pre-existing fine-grained and phyl- 
losilicate-rich surfaces that are absent in the powders. The slightly 


Table 1| Experimental details and mineral composition of the rocks tested. 


Experiment Sample Normal stress (MPa) Mineralogy (%) 
Calcite Tremolite Smectite Talc Chlorite 
P2045 Wafer L2 10; 20; 35; 50 43 36 14 6 
P2048 Wafer L3 10; 20; 35; 50 39 26 19 15 
P2049 Powder L3 10; 20; 35; 50 39 26 19 15 
P2050 Powder L3 75;100;150 39 26 19 alts) 
P2052 Wafer L3 75;100;150 39 26 19 15 
P2053 Wafer L2 75;100;150 43 36 14 6 
P2054 Wafer L2 10; 20; 35; 50 43 36 14 6 
P2056 Powder L2 10; 20; 35; 50 43 36 4 6 
P2057. Wafer-wet L2 10; 20; 35; 50 43 36 14 6 
P2066 Powder L2 75;100;150 43 36 14 6 


Mineral abundances were evaluated by TG-MS and XRPD (see Methods and Supplementary 
Fig. 1). 
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depths’ of 4-16 km. The geological cross-section AB through central and 
eastern Elba highlights the low-angle geometry™*. b, Outcrop photograph of 
the fault structure and locations of zones L2 and L3 where samples were 
collected. ¢, Detail of the foliated fault rock in L2. (Camera lens cap shown 
for approximate scale; diameter 4 cm.) 


higher friction coefficient of fault zone L2 compared to L3 (0.31 
versus 0.25) is probably due to the presence of foliation-perpendic- 
ular calcite-filled fractures that reduce the interconnectivity of the 
microstructure. The frictional strength of the solid wafers is com- 
parable to that of pure talc at similar sliding conditions’® even with 
the presence of 65-80% of strong calcite and tremolite minerals up to 
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Figure 2 | Friction experiments. a, The inset shows the double-direct shear 
geometry used to shear solid and powdered samples of fault rock. The main 
plot shows the coefficient of friction 1 versus shear displacement at the layer 
boundaries for a representative experiment at a normal stress of 150 MPa 
(sample L3). Strain hardening occurs initially, after which friction reaches a 
steady-state value. We note that sliding friction is significantly lower for the 
solid sample than for its powdered equivalent. b, Steady-state shear strength, 
measured during frictional sliding, plotted as a function of normal stress. 
Data for layers of powdered rock plot along lines with slopes 4 = 0.55 and 
f= 0.43 for L2 and L3 respectively. Data for shear of solid fault rock plot 
along lines with slopes 4: = 0.31 and pu = 0.25 for L2 and L3 respectively. 
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Figure 3 | Frictional properties of fault rocks and powders made from them. 
Steady-state friction plotted as a function of normal stress. The solid samples 
show friction values that are consistently 0.2-0.3 weaker than powders. 
Further weakening for intact fault rocks is achieved under water-saturated 
conditions. 


200 um wide (Table 1). This suggests that weak fault rocks can form 
with even small amounts of phyllosilicates as long as they form an 
interconnected, through-going foliated network. Although our 
experiments show that further weakening along phyllosilicate-rich 
rocks can be achieved by (1) the presence of fluids within the fault 
zone and (2) slow sliding velocities, the average value of p= 0.25 
from our dry experiments at room temperature is sufficient to 
explain the absence of measurable heat flow along weak faults and 
frictional reactivation of faults oriented up to 75° from the maximum 
compressive stress, like the San Andreas or the low-angle normal 
faults of the Apennines. 

In the core of ancient exhumed fault zones, where much of the 
deformation is accommodated, geological observations suggest that 
pervasive fluid influx can facilitate the dissolution of strong minerals 
and the precipitation of weak and fine-grained secondary 
phases**!”**7, With incremental shearing, these weak phyllosilicates 


Figure 4 | Comparison between solid-foliated and powder sliding surfaces 
in L3. a, c, Solid experiment (P2048), total shear displacement of ~25 mm, 
steady friction coefficient ji = 0.32. The calcite sigmoids are interdispersed 
within a fine-grained foliation (see detail in c) made of tremolite and 
phyllosilicates (smectite, chlorite and talc). The slipping processes occur 
along planes localized within the foliation. b, d, Powder experiment (P2049), 
total shear displacement ~25 mm, steady friction coefficient = 0.53. The 
fault rock shows a cataclastic texture with zones affected by grain-size 
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can align to form a foliation that ultimately results in an intercon- 
nected network that allows slip to be accommodated along the weak 
foliation and reduces the frictional strength of the fault zones**'”**””, 
These observations have been confirmed in laboratory experiments 
using rock analogue materials where the combination of fluid- 
assisted deformation and the development of a phyllosilicate foli- 
ation at low sliding velocities and high shear strains caused major 
weakening of the experimental fault gouge (a decrease in u from 0.8 
to 0.25)**”°. Here, by performing frictional sliding experiments on 
intact wafers of natural fault rocks with a pre-existing phyllosilicate- 
rich foliation, we have demonstrated that the presence of a phyllosi- 
licate foliation in fault zones results in significant fault weakening. 

One important question involves whether fault weakening via 
fabric development in phyllosilicate rich rocks rules out earthquake- 
like frictional instability. Previous works have noted the tendency for 
weak materials to exhibit inherently stable frictional slip’*'*”®. 
However, geological investigations have documented the mutual 
superposition between slip on phyllosilicates and brittle (hydrofrac- 
tures) or earthquake-related structures (pseudotachylytes)*®. Sharp 
and continuous slip zones of highly sheared clay-rich fault gouge have 
been interpreted as seismogenic principal displacement zones’’. 
Continuous strands of phyllosilicates usually bound lenses of stronger 
lithologies'*””: these lenses could represent sites for stress concentra- 
tions and earthquake nucleation near patches of fault creep’****". In 
addition, laboratory experiments on gouge mixtures show that 
when the phyllosilicates constitute a small percentage of the fault 
(10-30%), velocity-weakening behaviour is favoured at higher slip 
velocities”. A similar weak and velocity-weakening behaviour has 
been documented for smectite clay at low normal stress** (<30 MPa) 
and for clay-rich fault gouge in high-velocity friction experiments™. In 
this view, some crustal faults can behave as weak structures over long 
timescales (millions of years) and be intermittently seismogenic on 
shorter timescales. 


reduction (see detail in d) and shear localization along the R1, Y and B 
sliding surfaces. The little boxes in a and b represent the areas of ¢ and 
d, respectively. The half arrows above a and b show direction of shear. 
a-d are scanning electron microscopy back-scattered electron images. 
e, f, TEM images, showing details of the foliated microstructure (solid 
experiment P2048) that is made of an interconnected network of thin, 
oriented phyllosilicates (smectite and talc; low-contrast portions) 
surrounding tremolite fibres. Tr, tremolite; Tlc, talc. 
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We show that fault weakening depends strongly on rock fabric and 
the distribution of weak phases within a fault zone. The result is akin to 
findings from studies involving dissolution® and ductile deforma- 
tion’. However, we show that fabric-induced weakening can occur 
in the cataclastic, brittle deformation regime via frictional processes. 
Previous works focused on brittle fault strength have looked primarily 
at homogeneous mineral mixtures and bulk composition of the fault 
zone. Fluid—rock interactions and the growth of interconnected, 
phyllosilicate-rich networks have been documented in a wide variety 
of rock types*!7"'°**”7°_ We therefore propose that frictional sliding 
along such rock fabrics may be a viable mechanism to explain the 
mechanics of weak faults within the brittle crust. 


METHODS SUMMARY 


To determine the mineralogical composition of the fault rocks (Table 1), we have 
quantitatively combined results from X-ray powder diffraction (XRPD) and 
thermo-gravimetry/mass spectrometry (TG-MS) studies. 

For friction experiments we collected large blocks of foliated fault rocks per- 
taining to L2 and L3 domains and we cut them to form wafers 0.8-1.2 cm thick 
and 5 cm X 5 cm in area. The wafers were oriented so that they could be sheared 
in their in situ orientation, with foliation parallel to shear direction. We refer to 
these as solid experiments. We also carried out experiments on powders obtained 
from crushing and sieving the solid wafers. Experiments were conducted in a 
servo-controlled biaxial deformation apparatus'®”° (Fig. 2a inset), at room tem- 
perature and over a range of shear slip velocities from 1 to 300pms_'. Data 


reported on Figs 2 and 3 refer to a sliding velocity of 10ums |. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 


We chose rock samples from two highly foliated domains of the Zuccale fault (L2 
and L3 in Fig. 1). Mineral abundances in selected fault rocks have been deter- 
mined by coupling XRPD and TG-MS. For each sample, we collected three 
different XRPD patterns from: 

(1) powders (grain size <150 tm), representative of the bulk sample mineralogy 
(massive carbonate fragments + foliation); 

(2) fine fraction (grain size <2 1m, oriented samples), representative of the fine- 
grained foliation; 

(3) fine fractions after glycolation, to distinguish expandable versus not-expandable 
clays. 

The XRPD patterns on fine fractions revealed smectite, tremolite and talc as 
the main minerals within the foliation (Supplementary Fig. 1). The relative 
amounts of smectite, talc and tremolite, as reported in Table 1, were determined 
on the basis of XRPD intensity ratios. The total amount of calcite in the bulk 
samples was determined by TG-MS. The TG-MS data indicated that calcite 
decomposition occurred in the temperature range 600-800 °C. From the weight 
losses registered in this temperature range, we determined total calcite contents 
of 43% and 39% in L2 and L3, respectively. 

For the experiments we collected blocks of cohesive foliated fault rocks per- 
taining to L2 and L3 domains and we cut them to form wafers 0.8-1.2 cm thick 
and 5 cm X 5 cm in area. The wafers were oriented so that they could be sheared 
in their in situ orientation, with foliation parallel to shear direction. We refer to 


nature 


these as solid experiments. We also carried out experiments on powders obtained 
from crushing and sieving (<150 jum): (1) intact pieces of fault rock and (2) the 
samples used in the solid experiments. With this approach we directly compare 
the frictional properties of foliated fault rocks and their powdered equivalents, 
which have identical mineralogical compositions (Table 1). We found no dif- 
ference between experiments conducted on layers of powdered fault rock and 
powders from solid experiments. 

Experiments were conducted in a double direct shear geometry using a servo- 
controlled biaxial deformation apparatus'®”° (Fig. 2a inset). Three steel blocks 
sandwiched two layers of fault rock while the horizontal hydraulic ram, in load 
feedback mode, applied normal load to the layers. The vertical ram was advanced 
in displacement feedback mode at constant velocity, thereby shearing the two 
fault rock layers. Both normal and shear loads were measured by load cells at the 
load point with a precision of 0.1 KN. Displacement of the horizontal and vertical 
axes was measured using displacement transducers with 0.1 um precision. The 
sheared layers were initially 0.8-1.2 cm thick (solid fault rock) or 0.6 cm thick 
(powders) and we measured changes in layer thickness continuously during 
shear. Frictional contact area is constant during shear. Loads and displacements 
were continuously recorded for each axis at 10 kHz and averaged to rates from 1 
to 100 Hz, depending on the sliding velocity. Normal and shear stresses were 
obtained by dividing the measured applied loads by the contact area of the 
forcing block (5 X 5 cm) in the case of normal stress, and twice the contact area 
in the case of shear stress. 
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Common ecology quantifies human insurgency 


Juan Camilo Bohorquez', Sean Gourley”, Alexander R. Dixon®, Michael Spagat* & Neil F. Johnson” 


Many collective human activities, including violence, have been 
shown to exhibit universal patterns’"’. The size distributions of 
casualties both in whole wars from 1816 to 1980 and terrorist attacks 
have separately been shown to follow approximate power-law dis- 
tributions®””"°. However, the possibility of universal patterns 
ranging across wars in the size distribution or timing of within- 
conflict events has barely been explored. Here we show that the sizes 
and timing of violent events within different insurgent conflicts 
exhibit remarkable similarities. We propose a unified model of 
human insurgency that reproduces these commonalities, and 
explains conflict-specific variations quantitatively in terms of under- 
lying rules of engagement. Our model treats each insurgent popu- 
lation as an ecology of dynamically evolving, self-organized groups 
following common decision-making processes. Our model is con- 
sistent with several recent hypotheses about modern insurgency'*”, 
is robust to many generalizations”, and establishes a quantitative 
connection between human insurgency, global terrorism’® and eco- 
logy'*'”**”*, Its similarity to financial market models*” provides a 
surprising link between violent and non-violent forms of human 
behaviour. 

The political scientist Spirling’? and others”'® have correctly 
warned that finding common statistical distributions (for example, 
power laws) in sociological data is not the same as understanding 
their origin. Possible political, ideological, cultural, historical and 
geographical influences make conflict arguably one the ‘messiest’ 
of all human activities to analyse. Mindful of these challenges, yet 
inspired by recent studies of human dynamics'"'''””*, we analyse the 
size and timing of 54,679 violent events reported within nine diverse 
insurgent conflicts, placing equal emphasis on both finding and 
modelling common patterns. Such insurgencies typify the future 
wars and threats faced by society’®”’. 

Our data sources are real-time media databases, official (govern- 
ment and non-governmental organization) reports and academic 
studies. Supplementary Information provides details, plus data-set 
extracts. The event data from different conflicts were compiled by 
different researchers, often with cross-checking by independent 
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research teams, thereby reducing systematic collection or record- 
ing biases. Comparison of event accounts across a wide range of 
sources'*”° reduces potential media bias and mistaken aggregation 
(for example, misreporting two events of sizes x, and x) as one event 
of size x3 = x, + x2), which would create significant errors in a tail- 
dependent estimate such as a power-law slope. We focus on measuring 
deaths, because injuries are harder to cross-check. However, where 
possible, we check that our conclusions are robust to the inclusion of 
injuries. 

Figures 1 and 2 show our empirical findings for event size, whereas 
Fig. 3 shows event timings. Our model (described later and shown 
schematically in Fig. 4) provides a quantitative explanation of these 
findings by treating the insurgent population as an ecology of 
dynamically evolving, decision-making groups, in line with several 
recent sociological hypotheses'* *°. In addition to explaining the ubi- 
quity of approximate power laws in the event size distribution and 
the apparent central role of the 2.5 exponent value (Fig. 1), it explains 
the conflict-dependent deviations beyond a power law (see green 
curves in Fig. 2). Furthermore, the same model framework also 
explains the common burstiness in the distribution of event timings 
that we observe across insurgent conflicts (see black curves in Fig. 3). 

Figure 1 gives exponent values, obtained by applying Clauset et 
al.’s”'° established methodology for estimating discrete power-law 
distributions, p(x) =x *, for x=%Xmin where xmin is estimated 
together with «. In all cases we cannot reject the hypothesis that 
the size distribution of the events follows a power law, but we can 
reject log-normality. Four detailed examples are shown in Fig. 2. 
Following our preliminary 2005 results for Iraq and Colombia, we 
had suggested’” that other insurgent wars might be clustered around 
a =2.5. All the insurgent wars that we have analysed support this 
hypothesis. By contrast, we find that the Spanish Civil War and the 
American Civil War—neither of which are considered insurgent— 
each give distributions where log-normal can not be rejected, and 
where even the best-fit « value is much smaller (near 1.7, which is the 
value for the aggregated sizes of conventional wars’). This finding pro- 
vides quantitative support for claims circulating in social science’*”” 
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Figure 1| Power-law exponents. Value « for power law p(x) ~x ” deduced 
from the empirical distributions of event size x (that is, the number of 
casualties) for insurgent conflicts. Statistical procedures follow refs 9 and 10. 


Blue dot shows the value 2.48 for distribution of total size of global terrorist 
events, from Clauset et al.’°. The years in parentheses describe the empirical 
data set range used to deduce «, not the actual conflict duration. 
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Figure 2 | Size of events. a—d, Log—log plot of the complementary 
cumulative distribution of event size P(X = x) (that is, the probability of an 
event of size greater than or equal to x) for four conflicts from Fig. 1. 


that insurgent wars represent ‘open-source’'’, ‘fourth-generation’”” 
warfare, with qualitatively different dynamics from traditional wars. 
Several trivial explanations of the data can be ruled out, such as pro- 
portionality to city size’. 

Figure 3 demonstrates a common burstiness in the distribution for the 
number of events per day, n, irrespective of size. As explained in the 
Methods and Supplementary Information, we compare the distribu- 
tions over daily event counts for different epochs within the four modern 
conflicts for which we have such data, against control distributions 
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green curves show the results from our model. Blue dashed line is a straight 
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(‘random war’) obtained by randomizing event occurrences within each 
epoch. The data for each conflict (green circles) deviate from its random 
war (dashed curve) in a similar way: the real war exhibits an over- 
abundance of light days (that is, days with few attacks) and of heavy 
days (that is, days with many attacks), but a ‘lack of medium days 
compared with the random war (see lower panel). By considering 
subsets of days, we have determined that these features are not just an 
artefact of a variation in attack volume across days of the week (for 
example, Fridays; see Supplementary Information). Interestingly, this 
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burstiness has become more pronounced over time for the wars in both 
Iraq and Colombia, suggesting that they have become less random as 
they have evolved. These findings are insensitive to the precise specifica- 
tion of the epochs within a given conflict. 

Our model framework in Fig. 4 incorporates two key features: (1) 
ongoing group dynamics within the insurgent population (for 
example, as a result of internal interactions and/or the presence of 
an opposing entity such as a state army); (2) group decision-making 
about when to attack based on competition for media attention. 
Within this framework, we find that mechanism (1) dictates the 
features observed in Figs 1 and 2 whereas mechanism (2) dictates 
those in Fig. 3. Mechanism (1) is consistent with recent work on 
human group dynamics in everyday environments”, and with cur- 
rent views of modern insurgencies as fragmented, transient and evol- 
ving'*. Mechanism (2) is consistent with comments by former US 
Senior Counterinsurgency Adviser David Kilcullen, who noted” that 
when insurgents ambush an American convoy in Iraq, “... they’re not 
doing that because they want to reduce the number of Humvees we 
have in Iraq by one. They’re doing it because they want spectacular 
media footage of a burning Humvee.’ We consider the insurgent 
population as having an overall strength N comprising human com- 
batants, information, resources and weapons—though, for simpli- 
city, one can think of N humans. N is continually being re- 
partitioned through coalescence and fragmentation processes, 
thereby producing an ecology of groups. A group’s strength at 
time-step ft determines the number of human casualties x it would 
produce if it decided to engage in an event at that time-step. We take 
N to be approximately constant over time, though our main conclu- 
sions are unchanged if N evolves slowly with small fluctuations. 

These two coexisting dynamic mechanisms generate rich time series 
that can explain the numbers of events of different sizes at each time- 
step. However, because the data in Figs 1 and 2 are time-aggregated 
whereas those in Fig. 3 are size-aggregated, we can provide far more 
insightful explanations by using simplified versions that treat the 
respective non-dominant mechanism in an averaged way. Consider 
first the simple situation in which the group coalescence and frag- 
mentation processes in the insurgent population are represented by 
probabilities'*. The fragmentation probability vgag is taken to be small 
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Figure 4 | Model framework for insurgency. The insurgent population 
comprises an overall strength N, distributed into groups with diverse 
strengths at each time-step t. This distribution changes over time as groups 
join and break up. Dark shadows indicate strength, and hence casualties that 
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(~1%) to mimic the infrequent situation in which a group member 
suddenly senses imminent danger and the entire group scatters. If 
fragmentation does not occur, the group may coalesce with another 
group with probability v.,.1. This mimics the situation in which two 
individuals initiate a communications link between them of arbitrary 
range (for example, a mobile phone call), and hence their respective 
groups of strength s, and s, act in a coordinated way with strength 
(s; + sy). Because these two processes can be triggered by any particu- 
lar constituent group member at any time, the probability that it 
affects a specific group should be proportional to s (refs 24-26). 
Treating mechanism (2) in an averaged way, we assume that all groups 
are equally likely to be involved in an event over time. This is consis- 
tent with the time-averaged behaviour of the full decision-making 
model (see later). The time-averaged distribution of group strengths 
s therefore acts like the distribution of event sizes x (ref. 12), resulting 
in a steady-state approximate power-law distribution whose analytic 
solution « = 2.5 (refs 25, 26) is within the empirical bounds of Clauset 
et al.’s total value of 2.48 + 0.07 for global terrorism’®. This analy- 
tically obtained theoretical value*>”® of 2.5 is robust to many model 
generalizations”’”>*° (for example, coalescence of multiple groups, 
fragmentation into groups larger than one), thereby offering an 
explanation for the observed bunching of the empirical values around 
2.5 in Fig. 1. 

Invoking a more realistic mechanism for grouping dynamics than 
simple probabilities (see Supplementary Information), we find that 
our model framework can explain not only the approximate power- 
law behaviour and central role of the 2.5 exponent (Fig. 1) but also 
the behaviours beyond power law observed in Fig. 2. Accounting 
explicitly for an opposing population (for example, state army) with 
total strength Ng, the coalescence and fragmentation are now caused 
by the interactions between groups. The casualties produced by 
clashes between opposing groups can then be used to obtain the event 
size distributions (green curves in Fig. 2). Full details are given in the 
Methods and Supplementary Information, with the four model para- 
meters for each conflict (total insurgent strength Na, total state 
strength Ng and casualty scales C; and C,). When two opposing 
groups meet they fight, with some members of both groups killed 
and the smaller group fragmenting. Cs (C,) sets the scale for the 
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number of the smaller (larger) group’s members destroyed. As Cs is 
increased, the model deviates increasingly from a straight line at low 
x, suggesting that Afghanistan and Colombia share the following 
similarity: in a clash in which the insurgent group is the smaller 
group, this insurgent group takes heavier relative losses than for 
the wars in Iraq and Peru. The ratio between the two populations’ 
strengths (Na and N,) tends to control the slope itself, with greater 
strength differences resulting in steeper slopes. This suggests that 
there might be a greater difference between the strengths of the army 
and insurgency in Colombia than in Iraq or Afghanistan. The total 
insurgent strength Na controls the large x roll-off in Fig. 2. 
Afghanistan and Peru deviate substantially from power laws for large 
x, which our model interprets as relatively small insurgency strength. 
Colombia and Iraq hardly deviate from power laws for large x, imply- 
ing greater insurgency strength. 

Because Fig. 3 features data aggregated over size, we replace the 
detailed grouping dynamics (that is, mechanism (1)) bya time-averaged 
number of groups. Given the resolution of our data and the typical 
numbers of observed daily attacks, we take one time-step as equivalent 
to one day. Ifa group launches an attack during a day with many other 
attacks, its media coverage will in general be reduced. If, instead, it 
launches an attack on a quiet day, its media coverage will increase”. 
Each group receives daily some common but limited information (for 
example, public radio or newspaper announcements about previous 
attacks, opposition troop movements, a specific religious holiday, even 
a shift in weather patterns). The actual content is unimportant provided 
it becomes the primary input for the group’s decision-making process. 
(See ref. 26 for a full description of an equivalent financial-market 
version.) Although the groups are heterogeneous in terms of their 
strategies, they tend to converge towards similar responses when fed 
the same information”, thereby generating distributions (black curves) 
that are almost identical to those observed (green circles). Our model 
(see Supplementary Information and ref. 26) includes a confidence 
threshold that must be surpassed before any decision can be made, 
allowing us to interpret the increase in non-randomness over time for 
Iraq and Colombia as a decrease in this confidence threshold; that is, the 
insurgent groups in both wars have become less cautious over time 
about whether to launch attacks. Reference 30 presents independent 
empirical evidence that groups of humans do indeed use such generic 
decision-based mechanisms. 

To our knowledge, our model provides the first unified explana- 
tion of high-frequency, intra-conflict data across human insurgen- 
cies. Other explanations of human insurgency are possible, though 
any competing theory would also need to replicate the results of 
Figs 1-3. Our model’s specific mechanisms challenge traditional 
ideas of insurgency based on rigid hierarchies and networks, whereas 
its striking similarity to multi-agent financial market models**”° 
hints at a possible link between collective human dynamics in violent 
and non-violent settings”. 


METHODS SUMMARY 


For the event size distribution (Figs 1 and 2), we use Clauset et al.’s methodo- 
logy”'® to estimate power-law exponents, and test power-law and log-normal 
hypotheses, with the time-aggregated time series of events. This methodology”® 
is a widely accepted, published state-of-the-art statistical procedure for analysing 
power-law-like distributions. For the event timing distribution (Fig. 3), we 
divide the time series for the number of events per day into epochs. These epochs 
are chosen such that there is no significant trend in the moving-average within 
each epoch. The precise specification of each epoch’s time-window does not 
affect our main findings. We then generate 10,000 random wars by shuffling 
the date of the events within each section, averaging across the shuffles. Our 
model (Fig. 4) replicates the empirical size and timing patterns of Figs 1-3. Full 
details are given in Methods and Supplementary Information. 
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Rapid formation and selective stabilization of 
synapses for enduring motor memories 


Tonghui Xu’*, Xinzhu Yu'*, Andrew J. Perlik', Willie F. Tobin’, Jonathan A. Zweig’, Kelly Tennant’, Theresa Jones” 


& Yi Zuo! 


Novel motor skills are learned through repetitive practice and, 
once acquired, persist long after training stops'’. Earlier studies 
have shown that such learning induces an increase in the efficacy 
of synapses in the primary motor cortex, the persistence of which 
is associated with retention of the task*°. However, how motor 
learning affects neuronal circuitry at the level of individual 
synapses and how long-lasting memory is structurally encoded 
in the intact brain remain unknown. Here we show that synaptic 
connections in the living mouse brain rapidly respond to motor- 
skill learning and permanently rewire. Training in a forelimb 
reaching task leads to rapid (within an hour) formation of post- 
synaptic dendritic spines on the output pyramidal neurons in the 
contralateral motor cortex. Although selective elimination of 
spines that existed before training gradually returns the overall 
spine density back to the original level, the new spines induced 
during learning are preferentially stabilized during subsequent 
training and endure long after training stops. Furthermore, we 
show that different motor skills are encoded by different sets of 
synapses. Practice of novel, but not previously learned, tasks 
further promotes dendritic spine formation in adulthood. Our 
findings reveal that rapid, but long-lasting, synaptic reorganiza- 
tion is closely associated with motor learning. The data also sug- 
gest that stabilized neuronal connections are the foundation of 
durable motor memory. 

Fine motor movements require accurate muscle synergies that rely 
on coordinated recruitment of intracortical synapses onto corticospinal 
neurons®’”. Obtaining new motor skills has been shown to strengthen 
the horizontal cortical connections in the primary motor cortex*». In 
this study, we taught mice a single-seed reaching task (Supplementary 
Movie 1). The majority of 1-month-old mice that underwent training 
gradually increased their reaching success rates during the initial 4 days, 
and then levelled off (n = 42, Fig. 1a, b). There were a few mice (= 5) 
that engaged in extensive reaching, but continually failed to grasp the 
seeds. These mice normally gave up reaching after 4-8 days (Fig. 1b). To 
investigate the process of learning-induced synaptic remodelling in the 
intact motor cortex, we repeatedly imaged the same apical dendrites of 
layer V pyramidal neurons marked by the transgenic expression of 
yellow fluorescent protein (YFP-H line) in various cortical regions 
during and after motor learning, using transcranial two-photon micro- 
scopy® (Supplementary Fig. 1). Dendritic spines are the postsynaptic 
sites of most excitatory synapses in the brain and changes in spine 
morphology and dynamism serve as good indicators of synaptic 
plasticity”'®. Spines that were formed and eliminated were identified 
by comparing images from two time points, and then normalized to the 
initial images. Imaged regions were guided by stereotaxic measure- 
ments, ensuring the imaged neurons resided in the primary motor 


cortex. In several experiments, intracortical microstimulation was 
performed at the end of repetitive imaging to confirm that images were 
taken from the functionally responding motor cortex (Fig. 1c, Sup- 
plementary Notes and Supplementary Fig. 2). 
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Figure 1| Motor skill learning in adolescent mice promotes immediate 
spine formation in the contralateral motor cortex. a, A cartoon of motor 
training. b, Average success rates during training for learning and non- 
learning mice (mean + s.e.m., 42 learners and 5 no learners). ¢, An 
intracortical microstimulation map indicates that the imaged region is 
within the motor cortex. Scale bar, 1 mm. d, e, Repeated imaging of the same 
dendritic branches over one-day intervals reveals spine elimination (arrows) 
and formation (arrowheads), and filopodia (asterisks) in a general control 
(d) and a trained (e) mouse. Scale bar, 2 jum. f, Percentage of spines formed 
and eliminated under various control and training conditions immediately 
following the first training session (mean + s.d., ***P < 0.001). g, The 
degree of spine formation observed following the first training session is 
linearly correlated with the number of successful reaches during this session 
(1° = 0.77). 
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Unexpectedly, we found that motor learning led to rapid forma- 
tion of dendritic spines (spinogenesis) in the motor cortex contra- 
lateral to the reaching forelimb. One-month-old mice that finished 
30 reaches with more than 10 successes in the first day of training 
were imaged within 1h of the training session and showed 
10.6 + 1.1% new spines which were not in the images acquired the 
day before training. This spine formation was more than double that 
found in age-matched controls, which were handled similarly and 
imaged over the same period of time, but not trained (Fig. 1d-f, 
4.7 + 0.6% in general controls, P< 0.001). In contrast, spine elimi- 
nation measured in the same images was not significantly altered by 
motor learning during single training sessions (Fig. 1f, P> 0.9). In 
addition, mice that went through shaping but not training (shaping 
controls) or mice that were trained to reach for a seed too far away to 
grasp (activity controls) did not show an increase in spine formation 
rates (Fig. 1f, P>0.1 with general control, P< 0.001 with trained 
mice; see Methods for all control conditions). This suggests that 
refinement of fine motor movements, rather than other training- 
related experiences or unskilled motor activity, drives robust spine 
formation. Furthermore, the percentage of spines formed immedi- 
ately after the first training session is linearly correlated with the 
number of successful reaches during the training session, revealing a 
direct link between learning and spine formation (Fig. 1g, 7 = 0.77). 

Perfection of a motor skill often requires persistent practice over 
time. To examine how prolonged learning affects spine dynamics, we 
trained and imaged mice over different periods of time (that is, from 
2 to 16 days). We found that training for 2 days and longer resulted in 
significant increases, not only in spine formation, but also in spine 
elimination (Fig. 2a, b, P<0.005 at all time points). Although 
delayed, this increase in spine elimination ultimately resulted in 
the total spine density in the trained animals returning to control 
levels by day 16 (Fig. 2c). As a control, we measured spine formation 
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and elimination over a 4-day training period in the ipsilateral (to the 
trained limb) primary motor cortex and the contralateral posterior 
sensory cortex, and found no significant increase in spine formation 
or elimination in either case (Fig. 2a, b, d, e, P> 0.2). In addition, 
mice that failed to learn also failed to show an increase in either spine 
formation or elimination in the contralateral motor cortex (Figs 1b 
and 2a, b, f, P>0.6). Therefore, the observed changes in spine 
dynamics are region- and learning-specific, indicating that motor 
learning causes synaptic reorganization in the corresponding motor 
cortex. 

The enhanced spine loss after rapid spinogenesis reflects a rewiring 
of the neuronal circuitry in response to learning, rather than a simple 
addition of new spines. To examine how learning reorganizes synaptic 
connections, we imaged the same mice three times, classified imaged 
spines into new and pre-existing spines based on their appearance in 
the initial two images, and then quantified their survival percentages 
in the third images (Fig. 3a). Our data show that new spines are less 
stable than pre-existing spines in general (Fig. 3b, c). Specifically, in 
control mice, 43.8 + 3.1%, 25.8+5.3% and 19.2+4.6% of the 
spines that formed between days 0 and 4 remained by days 6, 8 and 
16, respectively. During the same period of time, 96.7 + 0.5%, 
94.9+ 1.1% and 92.8 + 1.9% of the pre-existing spines remained 
(Fig. 3d, P< 0.001 compared to new spines). These results suggest 
that new spines are initially unstable and undergo a prolonged selec- 
tion process before being converted into stable synapses. In addition, 
we found that new spines were significantly more stable in trained 
mice, with 64.1 + 2.2%, 55.3 + 4.1% and 51.0 + 4.8% of the spines 
that formed during the initial 4-day training remaining by days 6, 8 
and 16, respectively (Fig. 3d, P< 0.001 compared to new spines in 
control mice). In contrast, pre-existing spines in trained mice were 
significantly less stable than control mice over the same time periods 
(Fig. 3d, P<0.05). More importantly, when the fate of the new 


B® Training (contralateral forelimb motor cortex) 


a O Training (+500 um outside contralateral c 

forelimb motor cortex) 

Training (ipsilateral motor cortex) 

S 25 N Training (contralateral sensory cortex) 110, 
s &d Fail to learn (contralateral motor cortex) 
5 20; - see ms oe © Control 
@ 451 bic s © Training 
E o ** 
© 10; es i 
2 5, 
Bo 2 

2 4 8 16 5 
b Imaging intervals (days) @ 
© 25 e oS 
= ‘ei ao 
§ 207 g 
2 wee 5 
fe 15 wee EF 
£1404 : 
oO g 
@ 54 
= 8 
a 0 90 r 1 r 1 

2 4 8 16 ) 4 8 12 16 

Imaging intervals (days) Days 
Training Training Fail to learn 


(ipsilateral motor cortex) 


an 
Vaeun 


Figure 2 | Enhanced spine dynamics during adolescent motor training is 
region- and learning-specific. a, b, Percentage of spines formed (a) and 
eliminated (b) under control and training conditions. c, Total spine number 
increases during initial learning, but returns to normal levels with prolonged 
training. d, e, Imaging of the same dendritic branches over 4 days in the 
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(contralateral sensory cortex) 


a 
aie 3 


(contralateral motor cortex) 


ipsilateral primary motor cortex (d) and the contralateral sensory cortex 
(e) of the trained mice. f, Imaging of the same dendritic branches over 4 days 
in the contralateral motor cortex of a mouse that failed to learn the task. Data 
are presented as mean + s.d., *P < 0.05, **P < 0.01, ***P < 0.001. Scale bar, 
2 um. 
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of experiments, showing possible outcomes. b, c, Repeated imaging of 
dendritic branches at 0, 4 and 120 days in a control (b) and a trained 

(c) mouse. Scale bar, 2 jim. d, Percentages of surviving new and pre-existing 
spines, as a function of time, for control and trained animals (mean = s.d., 
*P< 0.05 and ***P < 0.001). Numbers of animals examined at each time 
point are indicated below new spine data points. 


spines formed during initial learning (day 0-4) was examined months 
later (day 120), we found that 42.3 + 2.9% of new spines persisted 
in the mice trained for 16 days during adolescence, whereas only 
13.5 + 1.7% of new spines remained in the control mice (Fig. 3d, 
P<0.001). In addition, we found that spine formation and stabiliza- 
tion were associated with behavioural improvement. More new spines 
were formed daily during the learning acquisition phase (days 1-4) 
than during the learning maintenance phase (days 5-16); the new 
spines that were formed during learning acquisition, but not during 
maintenance, were preferably stabilized with continuous training 
(Supplementary Notes and Supplementary Fig. 3). Taken together, 
these data indicate that motor learning selectively stabilizes learning- 
induced new spines and destabilizes pre-existing spines. The pro- 
longed persistence of learning-induced synapses provides a potential 
cellular mechanism for the consolidation of lasting, presumably per- 
manent, motor memories. 

Dendrites in the mammalian brain contain not only spines but also 
filopodia. Filopodia are long, thin protrusions without bulbous heads, 
and make up ~10% of the total dendritic protrusions in the motor 
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cortex of 1-month-old mice. Previous studies suggest that filopodia 
are precursors of dendritic spines'”'”. We found that filopodia were 
very dynamic in the mouse motor cortex in vivo. Most of them turned 
over within 1 day in control mice (79.3 + 12.8% formation and 
87.6 + 5.9% elimination), and motor learning had no effect on filo- 
podial formation and elimination (91.0 + 15.3% formation and 
86.5 + 8.8% elimination, P> 0.2). Among the filopodia observed in 
the initial images, few of them became spines over the following day in 
control mice (6.3%). However, this filopodium-to-spine transition 
was enhanced by motor skill learning (13.1%). Furthermore, 25% 
of new spines formed from filopodia on training day | persisted after 
another 4 days of training, indicating a contribution of filopodia to the 
rewired neuronal circuitry. Furthermore, when filopodia and spines 
were pooled together for analysis, there was a ~10% increase in the 
dynamics of both control and training categories. Thus, the conclu- 
sion of motor learning on total protrusions was consistent with the 
spine analysis alone (Supplementary Fig. 4). 

One of the important characteristics of motor learning is that, once 
the skill is well learned, its further maintenance does not require 
constant practice. To test whether lasting motor memories might 
be contained within structurally stable neural circuits, we trained 
young mice for 8-16 days to acquire the reaching skill, housed them 
in control cage conditions for 4 months, and retrained them on the 
same task in adulthood. We found that these pre-trained mice main- 
tained skilful performance with high success rates even on the first 
day of reintroducing the reaching task (Fig. 4a). Imaging of these pre- 
trained adult mice showed that spine formation and elimination 
during retraining were similar to those of naive adults without train- 
ing (Fig. 4b, e, g, P>0.1 for 4 and 8 days). In contrast, naive adults 
learning the reaching task for the first time had a learning curve 
similar to adolescent mice, and showed significantly higher spine 
formation and elimination than control adults (Fig. 4a—c, g, 4 and 
8 days, P< 0.01 with control for both formation and elimination). 
Next, we asked if learning a novel motor skill continued to drive 
synaptic reorganization in the pre-trained brain. To do this, we 
trained mice that had been pre-trained on the reaching task with a 
new motor task—the capellini handling task, which also requires fine 
forelimb motor skills (see Methods). We found that pre-trained mice, 
similar to naive adults, had enhanced spine formation and elimination 
during the training of this novel skill task (Fig. 4d, f, g, P< 0.001 
compared to control adults). Despite high spine dynamics induced 
by novel skill learning, most spines that were formed during adoles- 
cent learning of the reaching task and maintained in adults persisted 
after training with the capellini handling task (95.6 + 7.7%), suggest- 
ing that already stabilized synapses are not perturbed by novel learning 
in adults. These results indicate that synaptic structural coding out- 
lasts the early learning experience and persists in adulthood to support 
later maintenance of motor skills. The fact that novel learning experi- 
ences continue to drive synaptic reorganization without affecting the 
stability of synapses formed during previous learning further suggests 
that different motor behaviours are stored using different sets of 
synapses in the brain. 

Our study investigated the process of synapse reorganization in the 
living brain during natural learning, distinguishing it from several 
studies where changes were triggered by non-physiological sensory 
manipulation’*"*. Although rapid synapse formation has been 
observed during long-term potentiation in vitro’, we show, for the 
first time, that synapse formation in the neocortex begins immediately 
as animals learn a new task in the living brain (within 1h of training 
initiation). Such high spine formation does not occur with motor 
activity alone or later practice of the established skill. The rapidity of 
the response contradicts the general assumption that significant syn- 
aptic structural remodelling in motor cortex takes days to occur, fol- 
lowing more subtle cellular activity and changes in synaptic 
efficacy*”’”. One recent study on brain slices shows that glutamate- 
sensitive currents expressed in newly formed spines are indistinguish- 
able from mature spines of comparable volumes”’, further suggesting 
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Figure 4 | Novel motor skill training promotes spine formation and 
elimination in adult mice. a, Pre-trained mice start with high success rates 
during adult retraining (mean + s.e.m., 10 naive trained and 14 retrained 
adults). b-f, Repetitive imaging of dendritic branches over 4 days ina control 
adult (b), naive adults training with the reaching task (¢) and capellini 
handling task (d), and pre-trained adults retraining with the same reaching 
task and the new capellini handling task (f). Scale bar, 2 jim. g, Percentages of 
spines formed and eliminated over 4 days in adult mice under different 
conditions (mean + s.d., ***P < 0.001). 


that the new spines formed during learning are probably active. 
Furthermore, the persistence of new spines over months provides a 
long-lasting structural basis for the enhanced synaptic strength that is 
retained even when the task performance is discontinued. 

Many previous studies have used fixed tissue preparation to 
investigate changes in synapse number and dendritic complexity 
after motor skill learning’***. Our in vivo imaging of superficial 
dendrites from layer V pyramidal neurons revealed that postsynaptic 
dendritic spine addition was rapid, but eventually counteracted by 
the loss of pre-existing spines, resulting in a time-dependent spine 
density change during motor learning. Although the synaptogenesis 
observed in our study is compatible with earlier results, its temporal 
relationship with behavioural improvement and the contribution of 
synapse elimination in circuitry reorganization in other brain layers 
and regions during motor learning require further investigation. This 
eventual balancing of synapse number could be a homeostatic mech- 
anism by which the output layer V neurons integrate converging 
inputs into superficial cortical layers to govern precise fine motor 
control. 
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METHODS SUMMARY 


Young (1 month old) and adult (>4 months old) mice expressing YFP in a small 
subset of cortical neurons (YFP-H line”) were used in all the experiments. Young 
mice were trained on the single-seed reaching task for up to 16 days and dis- 
played a stereotypical learning curve (Fig. 1b). Naive adult mice and mice that 
had been previously trained with the single-seed reaching task in adolescence 
were trained with either the same reaching task or a novel capellini handling task 
for up to 8 days (see Methods). Apical dendrites of layer V pyramidal neurons, 
10-100 um below the cortical surface, were repeatedly imaged in mice under 
ketamine-xylazine anaesthesia with two-photon laser scanning microscopy. 
Spine dynamics in the motor cortex and other regions were followed over 
various intervals. Imaged regions were initially guided by stereotaxic measure- 
ments. In 14 mice, intracortical microstimulation (see Methods) was performed 
at the end of repetitive imaging to determine the location of acquired images 
relative to the functional forelimb motor map (Supplementary Fig. 2). In total, 
32,079 spines from 209 mice were tracked over 2—4 imaging sessions, with 121 
mice imaged twice, 79 mice three times and 9 mice imaged four times. Spine 
formation and elimination rates in each mouse were determined by comparing 
images of the same dendrites acquired at two time points; all changes were 
expressed relative to the total number of spines seen in the initial images. The 
number of spines analysed and the percentage of spine elimination and forma- 
tion under various experimental conditions are summarized in Supplementary 
Table 1. To quantify spine size, calibrated spine head diameters were measured 
over time” (Supplementary Notes). All data are presented as mean + s.d., unless 
otherwise stated. P-values were calculated using the Student’s test. A non- 
parametric Mann-Whitney U-test was used to confirm all conclusions. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 
Single-seed reaching task. Mice were food-restricted to maintain 90% of free 
feeding weight before the start of training. The training chamber was constructed 
as a clear Plexiglas box 20cm tall, 15cm deep and 8.5 cm wide into which each 
individual mouse was placed. Three vertical slits 0.5 cm wide and 13 cm high were 
located on the front wall of the box: in the centre, on the left side, and on the right 
side (Supplementary Fig. 5). A 1.25-cm-tall exterior shelf was affixed to the wall in 
front of the slits to hold millet seeds for food reward. The training included two 
phases: ‘shaping’ and ‘training’. The shaping phase (2—5 days in duration) was used 
to familiarize mice with the training chamber and task requirements and also to 
determine their preferred limbs. During the shaping phase millet seeds were placed 
in front of the centre slit and mice used both paws to reach for them. Shaping was 
considered finished when 20 reach attempts were achieved within 20 min, and the 
mouse showed >70% limb preference. Training started the day after shaping, and 
each training day consisted of one session of 30 trials with preferred limb or 20 min 
(whichever occurred first). Seeds were presented individually in front of the slit on 
the side of preferred limb. Occasionally a mouse used the non-preferred limb; 
however, because of the difficulties presented by reaching angle, such reaches 
usually were unsuccessful. Mice displayed three reach attempt types: fail, drop 
and success (Supplementary Movie 1). A ‘fail’ was scored as a reach in which the 
mouse failed to touch the seed or knocked it away. A ‘drop’ was a reach in which the 
mouse retrieved the seed, but dropped it before putting into its mouth. A ‘success’ 
was a reach in which the mouse successfully retrieved the seed and put it into its 
mouth. Success rates were calculated as the percentage of successful reaches over 
total reach attempts. About half of the mice in our experiments were right handed 
(55 right handed out of a total of 109 mice, 50.6%). All data collected from both 
left- and right-handed mice were pooled for analysis in this study. No significant 
difference was found in the reaching performance of left- and right-handed mice. 
All our control mice were littermates that underwent the same food restriction. 
All mice were handled (that is, removed from their cages and placed temporarily 
in the training chamber into which some seeds were dropped) by the same 
experimenters. To ensure that the increase seen in spine dynamics was learning 
specific, three different controls were used in our study. The first control group 
was general controls comprising mice with neither training nor shaping, but with 
food restriction, food reward and handling. The second was shaping controls in 
which mice received similar shaping as trained mice. During training, they were 
placed into the training chamber for 20 min daily, with ~15 seeds periodically 
dropped into the training chamber. This control group was used to determine 
whether the shaping period and/or experience of the training environment had 
any effect on spine dynamics. The third control group was activity controls in 
which mice were given similar shaping as trained mice. During training, mice were 
placed into the training chamber and trained to reach for a seed placed outside the 
slit for 20 min daily. However, the seed was placed out of reach, so that they could 
never obtain it and, therefore, did not learn skilful reaching movements (as shown 
by testing their performance occasionally). Thus, both trained mice and activity 
control mice experienced similar amounts of forelimb activity, but only trained 
mice developed the motor skill. The activity control was used to determine 
whether enhanced spine dynamics were caused by increased motor activity or 
were specific to motor skill learning. Our results indicate that there is no difference 
in the spine dynamics between the activity controls and general controls. 
Capellini handling task. This task was similar to the vermicelli handling tasks 
previously described for rats*!. Mice were food-restricted to maintain 90% of free 
feeding weight before training began. A daily training session consisted of 10 trials 
with uncooked capellini pasta pieces (2.5cm), given one piece per trial. Mice 
learned to use coordinated forepaw movements to eat the pasta. The average con- 
sumption time for one piece of capellini pasta decreased from 3.44 + 0.18 min on 
day 1 to 1.98 + 0.29 min on day 4 (mean + s.e.m., P< 0.005, 7 mice). There was no 
significant behavioural difference in the capellini handling task between naive adults 
and adults pre-trained in the reaching task in adolescence. 
In vivo imaging of superficial dendrites. The procedure for transcranial two- 
photon imaging has been described previously*. Mice aged 1-6 months were 
anaesthetized with an intraperitoneal injection (5.0 ml per kg body weight) of 
17mgml | ketamine and 1.7mg ml’ xylazine in 0.9% NaCl. The skull was 
exposed with a midline scalp incision and imaged regions were located based on 
stereotactic coordinates. A small region of skull (~300 um in diameter) was 
manually thinned down to ~20 um in thickness using both a high-speed drill 
and a microknife. To reduce respiration-induced movements, the skull was 
glued to a 400-j1m-thick stainless steel plate with a central opening for skull 
access. The plate was screwed to two lateral bars located on either side of the 
head and fixed to a metal base. The brain of the mouse was then imaged through 
the thinned skull using a Prairie Ultima IV multi-photon microscope with a Ti- 
sapphire laser tuned to the excitation wavelength for YFP (925 nm). Stacks of 
image planes were acquired with a step size of 0.70 tm using a water-immersion 
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objective (X60, NA 1.1 infrared Olympus objective) at a zoom of 3.0. For 
relocation of the same dendrites at subsequent imaging times, an image stack 
containing the dendritic structures of interest was taken without zoom with a 
step size of 2.0 um and the surrounding blood vessels were imaged with a CCD 
camera. The patterns of blood vessels and neuronal processes in this low-reso- 
lution image stack were used for relocating the same dendrites at each sub- 
sequent imaging session (Supplementary Fig. 1). After imaging, the plate was 
detached from the skull, the scalp sutured, and the animal was returned to its 
home cage until the next imaging session. 
Spine and filopodium identification. All analysis of spine dynamics was done 
manually using Image] software, blind with regard to experimental conditions. 
Briefly, the same dendritic segments (~5-—20 tm in length) were identified from 
three-dimensional image stacks selected from all views having high image quality 
(signal-to-background-noise ratio >4-fold). Individual dendritic protrusions 
were tracked manually along dendrites. Three-dimensional stacks, instead of 
two-dimensional projections, were used for analysis to ensure that tissue move- 
ments and rotation between imaging intervals did not influence identification of 
dendritic protrusions. The number and location of dendritic protrusions (protru- 
sion length >1/3 dendritic shaft diameter) were identified in each view. Filopodia 
were identified as long thin structures with head diameter/neck diameter <1.2 and 
length/neck diameter >3. The remaining protrusions were classified as spines. 
Analysis of spine and filopodial dynamics. Notations of the formation and 
elimination of spines and filopodia were based on comparison of the images 
collected at two different time points. Spines or filopodia were considered the same 
between two views if they were within 0.7 jum of their expected positions, based on 
their spatial relationship to adjacent landmarks and/or their position relative to 
immediately adjacent spines. A stable spine is a spine that was present in both 
images. An eliminated spine is a spine that appeared in the initial image, but not 
the second image. A newly formed spine is a spine that appeared in the second 
image, but was absent from the initial image. Percentages of stable, eliminated and 
formed spines were all normalized to the initial image. Percentage changes in the 
total spine number over a given interval were relative to the first view and calculated 
as percentage of formation minus percentage of elimination measured over that 
interval. Data on spine dynamics are presented as mean + s.d.. 
Image processing and presentation. Two-dimensional projections of three- 
dimensional image stacks containing in-focus dendritic segments of interest 
were used for all figures. We chose very sparsely labelled regions as examples 
and maximum projections were made from images from 2-4 focus planes. There 
were normally few crossing structures in the projected images from such a 
shallow stack, and the presented branches could be clearly isolated. Finally, 
images were thresholded, Gaussian filtered and contrasted for presentation. 
Mapping of motor cortex by intracortical microstimulation. This method was 
adapted from those used in rat experiments*!. Mice were anaesthetized with an 
initial cocktail of ketamine (150mgkg ', intraperitoneal) and xylazine 
(10 mgkg ', intraperitoneal) and supplemented with additional ketamine and 
isofluorane (0.5—1% in oxygen) as necessary. The mouse was placed into a mouse 
stereotaxic frame (Stoelting), lidocaine (2 mg kg” ', subcutaneous) was injected 
into the scalp, and a midline incision was made. The cisterna magna was drained 
to prevent cortical swelling and the skull and dura overlying the motor cortex 
were removed. The craniotomy was then filled with warm (37 °C) silicone oil to 
prevent drying. A picture of the cortical surface was taken and overlaid with a 
250 um square grid in Canvas software. 

Intracortical penetrations ofa glass microelectrode (diameter of 20-25 jum) with 
a platinum wire were made at 250 um intervals in a systematic order throughout 
the cortex at a depth of 790-800 jim (corresponding to deep layer V/shallow layer 
VI) with a hydraulic micropositioner until the entire extent of the forelimb rep- 
resentation was resolved. A 40-ms train of 13 200-{1s monophasic cathodal pulses 
was delivered at 350 Hz from an electrically isolated, constant current stimulator at 
arate of 1 Hz stimulation and current was increased to a maximum of 60 LtA untila 
visible movement was evoked. If a movement was evoked at or below 60 1A, the 
threshold current was determined by gradually decreasing the stimulation until 
the movement stopped. The lowest current that evoked a movement was taken as 
the threshold current. If no movement was seen at 60 1A, the site was considered 
non-responsive. In cases where stimulation evoked more than one movement, the 
site was considered responsive to the movement that was determined to have the 
lowest threshold. To verify that the stimulation position was located within layer V, 
we injected Dil in seven mice at the end of the experiments and found that all 
injections left deposits extending through mid layer V to mid layer VI. In addition, 
penetrating electrode tracts could be observed in Nissl-stained coronal sections in 
most mice. Most (81.3 + 4.7%) of these tracts terminated in layer V at a measured 
depth of 782 + 137 1m, with the remainder terminated in upper layer VI. 


31. Allred, R. P. et al. The vermicelli handling test: a simple quantitative measure of 
dexterous forepaw function in rats. J. Neurosci. Methods 170, 229-244 (2008). 
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Stably maintained dendritic spines are associated 


with lifelong memories 


Guang Yang", Feng Pan’ & Wen-Biao Gan’ 


Changes in synaptic connections are considered essential for learn- 
ing and memory formation’ *. However, it is unknown how neural 
circuits undergo continuous synaptic changes during learning while 
maintaining lifelong memories. Here we show, by following post- 
synaptic dendritic spines over time in the mouse cortex”*, that 
learning and novel sensory experience lead to spine formation and 
elimination by a protracted process. The extent of spine remodelling 
correlates with behavioural improvement after learning, suggesting 
a crucial role of synaptic structural plasticity in memory formation. 
Importantly, a small fraction of new spines induced by novel experi- 
ence, together with most spines formed early during development 
and surviving experience-dependent elimination, are preserved and 
provide a structural basis for memory retention throughout the 
entire life of an animal. These studies indicate that learning and 
daily sensory experience leave minute but permanent marks on 
cortical connections and suggest that lifelong memories are stored 
in largely stably connected synaptic networks. 

One remarkable feature of the mammalian brain is its capacity to 
integrate new information throughout life while stably maintaining 
memories. Coincident with these two seemingly mutually exclusive 
attributes of the brain are plasticity and stability of synaptic 
connections'"'. It is well-established that the strength and number 
of synaptic connections can undergo rapid and extensive changes after 
sensory alterations and learning throughout life’****'*""?. On the 
other hand, recent studies have shown that dendritic spines, the post- 
synaptic sites of excitatory synapses, are remarkably stable in adult 
life’ °. Therefore, synaptic connections are not only capable of under- 
going rapid changes in response to new experience but also can serve 
as substrates for long-term information storage. However, it remains 
unknown how and to what degree synapses reorganize during learning 
and how such reorganization is transformed into lifelong memories. 

To address these questions, we used transcranial two-photon micro- 
scopy to examine how fluorescently labelled dendritic spines of layer V 
pyramidal neurons in the mouse cortex are altered and maintained in 
response to skill learning or novel sensory experience’”’**. We first 
examined spine dynamics in the primary motor cortex after motor skill 
learning on an accelerated rotarod”*” (see Methods). In this rotarod 
learning task, animals changed their gait pattern and learned specific 
movement strategies beyond simply running quickly”’. In the forelimb 
area of the motor cortex, rotarod training over 2 days leads to a sig- 
nificant increase (~5-—7%) in spine formation in both young (1 month 
of age) and adult (>4 months) mice (P< 0.001; Fig. la-f and Sup- 
plementary Table). The increased spine formation was not observed in 
mice subjected to running similar distances on a slowly rotating 
rotarod and was region specific, occurring in the forelimb motor cortex 
but not in the barrel cortex (Fig. 1f). Notably, after being trained for 
2 days, spine formation over the next 2 days remained significantly 
higher if mice were trained with a different type of motor task (reverse 
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Figure 1| Motor learning and novel sensory experience promote rapid 
dendritic spine formation. a, Transcranial two-photon imaging of spines 
before and after rotarod training or sensory enrichment. b, CCD camera 
view of the vasculature of the motor cortex. c, Two-photon image of apical 
dendrites from the boxed region in b. A higher magnification view of a 
dendritic segment in ¢ is shown in d. d, e, Repeated imaging of a dendritic 
branch before (d) and after rotarod training (e). Arrowheads indicate new 
spines formed over 2 days. f, The percentage of new spines formed within 
2 days in the motor cortex was significantly higher in young or adult mice 
after training as compared with controls with no training or running on a 
non-accelerated rotarod. No increase in spine formation was found in the 
barrel cortex after training. g, After previous 2-day training, only a new 
training regime (reverse running) caused a significant increase in spine 
formation. h, EE increased spine formation over 2 days in the barrel cortex in 
both young and adult animals. No significant increase in spine formation was 
found under EE when the whiskers were trimmed. i, After previous 2-day EE, 
animals switched to a different EE showed a higher rate of spine formation 
than those returned to SE. Data are presented as mean + s.d. *P < 0.005. See 
Supplementary Table for the number of animals in each group. 
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running) than if mice were subjected to the same type of training or no 
training (P< 0.005; Fig. 1a, g). Thus, motor learning experience, not 
just physical exercise, induces rapid spine formation within 2 days in 
the primary motor cortex. 

To further understand experience-dependent spine plasticity, we 
examined the impact of novel sensory experience on spine formation 
in the barrel cortex, the primary somatosensory area for whisker sen- 
sation, by switching animals from a standard housing environment 
(SE) to an enriched environment (EE) (see Methods). When either 
young or adult mice were switched from SE to EE, spine formation 
over 1-2 days was significantly (~5%) higher than that under SE 
(Fig. la, h; P< 0.001; Supplementary Fig. 1). After being housed in 
an EE for 2 days, spine formation over the next 2 days remained sig- 
nificantly higher if mice were housed in a different EE than if mice 
were switched from EE to SE (P< 0.005; Fig. li). Notably, sensory 
deprivation by whisker trimming prevented the increase in spine 
formation associated with EE over 2 days (P> 0.2; Fig. 1h). Thus, 
novel sensory whisker experience, not simply the exploratory activity 
of the animals under EE, induces new spine formation in the barrel 
cortex. It is worth mentioning that regardless of animals’ ages, neither 
EE nor motor learning increased the number of new dendritic filopodia, 
spine precursors’***, over 2 days (Supplementary Fig. 2). Together, 
these findings indicate that at different stages of animals’ lives, learning 
and novel sensory experience induce rapid and extensive spine forma- 
tion in functionally relevant cortical regions. 

To gain insights into the functional significance of new spines, we 
examined the maintenance of new spines under various conditions 
(with or without skill learning, housed under EE or SE). We found 
that regardless of the animals’ ages or conditions, a small fraction of 
new spines formed over 2 days remained over the next 2 weeks 
whereas most new spines (>75%) were eliminated (Fig. 2a—c and 
Supplementary Fig. 3). Interestingly, a significantly larger fraction of 
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Figure 2 | A fraction of newly formed spines persists over weeks and 
correlates with performance after learning. a, New spines induced by novel 
experience were identified in the first 2 days and followed over time. b, c, The 
survival of new spines (mean = s.d.) over time under various conditions. A 
significantly larger fraction of new spines remained in mice trained 
repeatedly or housed under EE continuously. The lines represent two 
exponential fittings (77 = 1). d, e, An animal’s performance at day 7 strongly 
correlated with new spines formed during the first 2-day training and 
persisting at day 7 (d), but did not correlate with the total new spines 
accumulated from day 0 to 7 (e). Each circle represents an individual animal. 
The linear regression lines and correlation coefficients (r) are shown. See 
Supplementary Table for the number of animals in each group. 
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new spines lasted over 2 weeks when the mice were trained for 4-14 
consecutive days than when they were trained for only 2 days 
(P < 0.05; Fig. 2b). Similarly, a larger fraction of new spines remained 
if mice continued to stay in the EE than if they were switched from EE 
to SE or stayed under EE but with their whiskers trimmed (Fig. 2c). 
Thus, although new spines are rapidly induced by novel experience 
(Fig. 1f, h), only a small fraction of them are maintained over weeks 
by a protracted process facilitated by persistent experience. 

Many lines of evidence suggest that functional reorganization of 
mammalian cortex associated with motor and sensory training consists 
ofa fast phase (within an individual training session) and a slow phase 
(between training sessions)**”*. The improvement of performance 
between sessions reaches a plateau over days to weeks and can persist 
for months to years”””***”*, The survival of a fraction of new spines for 
weeks suggests that they may be important for slow-phase learning and 
memory retention. Indeed, we found that the proportion of new spines 
that were formed within the first 2 days and remained at day 7 highly 
correlated with the retention of learned motor skills, as quantified by 
the average running speed that mice mastered on an accelerated 
rotarod (r= 0.93; Fig. 2d and Supplementary Fig. 4). In contrast, the 
extent of new spines accumulated from the beginning of training until 
day 7 did not correlate with motor skill performance (r= —0.13; 
Fig. 2e), underscoring the importance of experience-specific spine 
formation rather than increased spine turnover in general. The strong 
correlation between maintained new spines and slow-phase learning 
suggests that new spines are important for the reorganization of 
cortical circuits that underlie new motor skills. Furthermore, because 
a fraction of new spines induced by novel sensory experience are 
maintained, they may be important for receptive field reorganization 
in barrel cortex and contribute to whisker-based decision making””*. 

In addition to promoting synapse formation, experience plays an 
important role in eliminating excessive and imprecise synaptic con- 
nections formed early during development***. To understand 
experience-dependent synaptic remodelling further, we examined 
the elimination of early formed spines in young mice subjected to 
motor training or exposed to EE. We found that in 1-month-old 
mice, neither motor training nor novel sensory experience increased 
the elimination of existing spines or filopodia over 2 days in motor or 
barrel cortex (P > 0.4; Fig. 3a, b and Supplementary Fig. 2). However, 
a significant increase in spine elimination (~4.5%) was observed in 
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Figure 3 | Novel experience promotes spine elimination. a, b, Percentage of 
spines eliminated (mean + s.d.) in young animals under various conditions. 
Rotarod training (a) or EE (b) for at least 7 days increased the elimination of 
existing spines (P < 0.05). c, EE increased the elimination of spines that 
existed for more than 2 days before EE exposure (P < 0.05). d, The 
elimination of existing spines over 7 days strongly correlated with an 
animal’s performance on day 7 (r= 0.94). Each circle represents an 
individual animal. 
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motor cortex when mice were subjected to training for 7-14 days 
(P< 0.05; Fig. 3a). Similarly, more spines were eliminated in barrel 
cortex if mice continued to stay in EE for 7-30 days than if they were 
switched from EE to SE or stayed under EE but with their whiskers 
trimmed (P< 0.05; Fig. 3b). Furthermore, we found that the elimi- 
nation of spines that have existed for at least 2 days was increased by 
new experience over 4—5 days (P< 0.05; Fig. 3c). Because the spines 
in this pool have likely all made synaptic contacts with axonal 
terminals'>*, these results suggest that new experience leads to the 
pruning of existing synapses and could cause significant functional 
changes in cortical circuits. Indeed, we found that 1 week after motor 
training, motor performance strongly correlated with the degree of 
spine elimination (r= 0.94; Fig. 3d). Thus, motor learning and novel 
sensory experience involve not only new spine formation but also 
permanent removal of connections established early in life (Sup- 
plementary Fig. 5). 

Although the above findings are consistent with the general notion 
that structural synaptic plasticity is critical for learning and memory, 
they raise a fundamental issue about how ongoing experience- 
induced synaptic reorganization can be reconciled with the stability 
needed to support lifelong memories. To address this issue, we first 
examined whether new spines could be maintained over a lifetime. Ifa 
significant number of new spines could last throughout an animal’s 
lifespan, they could directly contribute to permanent memory storage. 
Otherwise, lifelong memory storage cannot rely on these new spines 
and may involve continuous rewiring of synaptic networks. 

To distinguish between these possibilities, we examined the sur- 
vival of new spines over many months in motor and barrel cortices. 
We found that ~4—5% of new spines formed over 2 days persisted for 
at least 3 months in motor cortex (2 of 42 spines formed after 2-day 
training) and for at least 5 months in barrel cortex (2 of 50 new 
spines). Thus, a tiny fraction of daily formed new spines (~0.2% 
of the total spines) could persist for 3-5 months. Because it is difficult 
to measure directly and accurately a small fraction of new spines 
surviving over many months, we estimated long-term survival of 
new spines based on the fact that the accumulation of new spines 
depends on the formation rate of new spines and their survival 
fraction (Fig. 4a and Supplementary Information 1). Because the rate 
of spine formation is relatively constant throughout adult life (Sup- 
plementary Fig. 6) and the survival fraction of new spines is compar- 
able under a constant environment (Figs 2b, c and 5), we found that 
our direct measurement of new spine accumulation over time in 
barrel cortex can be best fitted by three exponential components with 
time constants of ~1.5days, ~1-2 months and ~73-80 months, 
respectively (Fig. 4a and Supplementary Information 1). The first 
two exponential components suggest that most daily formed spines 
have an average lifetime of ~1.5 days and a small fraction have an 
average lifetime of ~1—2 months. Importantly, the third component 
suggests that ~0.8% of daily formed new spines have an average 
lifetime of ~80 months under SE and ~73 months under EE (Fig. 4a 
and Supplementary Information 1). Because the degree of spine 
formation and the survival of new spines are comparable between 
motor and barrel cortices, a similar degree of daily generated new 
spines in motor cortex are also expected to last over the entire life of 
an animal. 

Based on the survival function of new spines and ~5—7% spine 
formation over 2 days under EE or motor learning conditions (Fig. 1), 
we estimated that the number of new spines formed over 2 days and 
persisting at the end of life would be ~0.04% of the total spines in 
motor or barrel cortex (assuming the mouse lifespan is ~36 months; 
Supplementary Information 2). Given the large quantity of spines in 
the mouse cortex, the number of learning-induced and subsequently 
maintained new spines could be ~2 X 10°, sufficiently many to have 
a significant and lifelong impact on neural network functions and an 
animal’s behaviour’”** (Supplementary Information 2). 

Although a fraction of daily generated spines persist and could 
directly contribute to lifelong memory storage, it is important to 
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Figure 4 | Maintenance of daily formed new spines and spines formed 
during early development throughout life. a, New spine accumulation over 
time under SE and EE. Three exponential fits show that ~0.8% of daily 
formed new spines decay with a time constant of 80 months under SE and 
73 months under EE. b, The percentage of adult spines remaining over time 
under SE and EE. Three exponential fits show that ~90% of adult spines 
have an average lifetime of 90 months under SE and 71 months under EE. ¢, A 
large fraction of spines formed before P30 persisted throughout life under SE 
or EE. The projections based on a and b are shown in the dashed frame. 

d, Mice previously trained at P30 for 7 days showed better performance 
(mean = s.e.m.) when assessed at 4 months of age than naive mice 

(P < 0.01). e, Only a new training regime (reverse running) caused an 
increase in spine formation in previously trained animals. Spine data are 
presented as mean = s.d. 


note that they represent a minute portion (~0.04%) of the total spine 
population at the end of an animal’s life and likely have their impact 
on the animal’s behaviour in the context of existing circuitry rather 
than acting alone. Because the pruning of existing spines is an 
important aspect of learning (Fig. 3), this raises the question of 
whether early formed spines would persist throughout adult life. If 
a fraction of early formed spines were maintained over a lifetime, they 
may serve as substrates for preserving basic cortical functions and 
early memories. Otherwise, the physical substrates of early memories 
would have to be re-established in synaptic networks that are formed 
later in adulthood. 

To address this question, we measured the survival of existing spines 
over many months in barrel cortex under SE and EE. We found that in 
4-month-old adult mice, ~86% and ~83% of existing spines are 
maintained over a period of 5 months under SE and EE, respectively 
(Fig. 4b). Based on the survival of existing spines over 5—18.5 months, 
we estimated that ~90% adult spines have an average lifetime of 
~90 months under SE and ~71 months under EE (Fig. 4b and Sup- 
plementary Information 3). Furthermore, we found that ~78% and 
~73% of existing spines are maintained from postnatal day 30 (P30) to 
2 months of age under SE and EE, respectively (Fig. 3b). Assuming a 
lifespan of 36 months, ~48% (under SE) and ~37% (under EE) of 
spines existing at P30 would remain at the end of life (Fig. 4c). Thus, 
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Figure 5 | Spine maintenance in different cell types and cortical layers. 
a-c, Age-dependent change in spine number is remarkably similar across 
different cell types/cortical layers in barrel cortex and contains information 
on spine dynamics. Total spine number (percentage of P19) of layer V 
pyramidal cell apical dendrites (a) was measured through in vivo imaging. 
Spine densities (mean + s.e.m.) of layer V and layer VI pyramidal cell basal 
dendrites (b, c) were measured on dendritic segments located 50-100 jim 
from the soma in fixed brain slices. d, Spine formation rate declined rapidly 
from P19 to P30 and remained low thereafter. e, Regardless of animals’ ages 
(P19, P30, 6 months), a fraction of new spines formed over 2 days were 
maintained over a similar protracted process. f, Schematic summary of spine 
remodelling and maintenance throughout life. Spines are rapidly formed 


regardless of housing environments, a large fraction of spines that are 
formed before P30 in barrel cortex would persist throughout life. 
Because motor learning and novel sensory experience lead to a similar 
degree of spine remodelling in either young or adult mice (Figs 1-3 and 
Supplementary Table), a large fraction of early formed spines are also 
expected to be stably maintained in the motor cortex. Together, these 
results suggest that spines formed early during development and sur- 
viving experience-dependent elimination could provide a scaffold for 
basic cortical function and lifelong memory storage. 

By examining how spines reorganize and maintain in response to 
novel experiences (Figs 1-4), our studies have revealed the existence 
of two populations of stable spines in synaptic circuits. One popu- 
lation constitutes new spines specifically induced by novel experience 
and maintained later in life. The other population comes from a large 
spine pool formed during early postnatal development, pruned by 
developmental experiences and surviving throughout adulthood. 
Because spines in both populations have an average lifetime between 
70 and 90 months (Fig. 4a, b), ~60—70% of them could persist over 
an animal’s life and directly support lifelong memories in synaptic 
circuits. 

One prediction from such a synaptic model of memory storage is 
that information should still be maintained even though ~30-40% 
of synapses in the circuitry are lost. To test this experimentally, we 
trained animals on the rotarod task from P30 to P37 and tested their 
performance at 4 months of age, when ~30% of spines that existed at 
P30 were eliminated in barrel and motor cortices (Fig. 4c and 
Supplementary Table). We found that animals previously trained 
at P30 could still maintain their learned motor skills when tested 
again at 4 months of age (Fig. 4d). Notably, the same training regime 
did not result in a significant increase in spine formation over 2 days 
in these previously trained mice (P> 0.2), whereas a different train- 
ing regime did (P< 0.02) (Fig. 4e). These findings are consistent with 
the above synaptic model of memory storage, suggesting that 
dynamic (~30% spine loss) but largely stable circuits could maintain 
previously acquired skills. 

By studying spine dynamics of layer V pyramidal cell apical dendrites, 
our results suggest that spine maintenance is a fundamental feature of 
neural circuits important for memory storage. However, it remains 
unclear whether the same rule regulating spine dynamics on layer V 
apical dendrites applies to spines in other cell types or cortical layers or 
regions. As shown below, by analysing age-dependent developmental 
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after birth, undergo experience-dependent pruning during postnatal 
development and remain largely stable in adulthood. Learning or novel 
sensory experience induces rapid formation of new spines (~5% of total 
spines) within 1-2 days. Only a tiny fraction of new spines (~0.04% of total 
spines) survive the first few weeks in synaptic circuits and are stably 
maintained later in life. Novel experience also results in the pruning of a 
small fraction of existing spines formed early during development. New 
stable spines induced by novel experience, together with existing spines 
formed during early development and surviving experience-dependent 
pruning, provide an integrated and stable structural basis for lifelong 
memory storage, despite ongoing plasticity in synaptic networks. 


profiles of spine number, we found evidence that stably maintaining a 
fraction of new spines and spines formed early in life is likely to be a 
general rule for lifelong information storage in the cortex. 

Many lines of evidence indicate that developmental change in 
synapse number is remarkably similar across different cortical layers 
and regions ina variety of species”***°. We found that in the dendrites 
of layer V and VI pyramidal neurons in mouse barrel cortex, the 
number of spines rose rapidly after birth, underwent a substantial 
net loss during late postnatal life and declined slowly throughout 
adulthood (Fig. 5a—c). Importantly, in the apical dendrites of layer V 
pyramidal cells, we found that the substantial net loss of spines during 
postnatal development was due to a combination of two factors: (1) a 
tremendous burst in spine formation early in life was followed by a 
rapid decline in spine formation from P19 to P30 (Fig. 5d); and (2) 
regardless of developmental stages, only a small fraction of newly 
formed spines were maintained by a similar prolonged process 
(Fig. 5e and Supplementary Fig. 7). Specifically, a substantial net loss 
of spines occurred during the late postnatal period because early 
formed spines (before P19) continued to be eliminated from P19 to 
P60 at a rate higher than that of new spine addition. The remarkably 
similar patterns of developmental spine loss in different cortical layers 
and species suggest that both a rapid decline in spine formation and 
maintenance of a fraction of new spines by a prolonged process are 
general rules in the development of the mammalian cortex (Sup- 
plementary Information 4). 

If stably maintaining a fraction of new spines by a prolonged 
process is a common rule, do most adult spines in other cells and 
layers persist as those of layer V pyramidal cell apical dendrites? 
Assuming that spine formation is constant throughout adulthood 
and all spines that survive a prolonged process have the same average 
lifetime, t, the total number of adult spines would change according 
to the equation A + Be “* (Supplementary Information 5). Based on 
the gradual decline in spine number of apical dendrites of layer V 
pyramidal cells (Fig. 5a), we estimated that the average lifetime of 
adult spines is ~71 months. This number is highly comparable to the 
average lifetime of new stable spines (Fig. 4a; 73-80 months) or exist- 
ing spines (Fig. 4b; 71-90 months) that we measured with the in vivo 
imaging approach. Furthermore, based on the age-dependent decline 
in spine density of basal dendrites of layer V and VI pyramidal cells 
(Fig. 5b, c), we estimated that the average lifetime of adult spines 
on these dendrites is ~70-125 months. Together, these projections 
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suggest that (1) developmental profiles of spine number contain 
important information on spine dynamics, and (2) most adult spines 
in other cell types and cortical layers could be stably maintained and 
serve as substrates for long-term information storage. 

Determining how long-lasting memories are stored in neuronal 
circuits remains a great challenge. Because synapses undergo rapid 
changes in response to environmental perturbations, it is unknown 
how dynamic synaptic circuits maintain indelible memories. Here we 
show that, despite ongoing circuit plasticity, two populations of 
stable spines are important for maintaining lifelong memories. 
Specifically, our findings suggest that a minute fraction of new spines 
(~0.04% of total spines) induced by novel experience, together with 
spines formed early during development and remaining after experi- 
ence-dependent pruning, represent a unique and stable physical 
entity for lifelong memory storage (Fig. 5f and Supplementary 
Discussion). The fact that most spines in such an entity persist under- 
scores the fundamental importance of stably connected synaptic cir- 
cuits in lifelong memory storage. 


METHODS SUMMARY 


Mice expressing YFP (H-line) were used in all the experiments. Sensory enrich- 
ment was conducted by placing mice in standard mouse cages containing strings 
of beads whose positions were changed daily. Motor training was performed by 
placing mice on an accelerated motorized rod. The rotation speed was recorded 
when the animal could not keep up with the rotating-rod and fell. The perform- 
ance was measured as the average speed animals achieved during the 20-trial 
training session per day. The procedure of in vivo transcranial two-photon 
imaging, spine density measurement and data quantification was described 
previously”*. P values were calculated using Student’s t-test. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Experimental animals. Mice expressing YFP in layer V pyramidal neurons 
(H-line) were purchased from the Jackson Laboratory and group-housed in 
the Skirball animal facilities. All experiments were done in accordance with 
institutional guidelines. 

Sensory enrichment. Sensory enrichment was conducted in standard mouse 
cages containing strings of beads hanging from the top of the cages 
(Supplementary Movies 1 and 2). The positions of bead strings were changed 
daily. Mice could move freely in these cages and had to navigate through the 
strings of beads to obtain food and water. 

Rotarod training procedure. An EZRod system with a test chamber 
(44.5cm X 14cm X 51cm, Accuscan Instruments) was used in this study. 
Animals were placed on the motorized rod (30 mm in diameter) in the chamber. 
The rotation speed gradually increased from 0 to 100r.p.m. over the course of 
3 min. The time latency and rotation speed were recorded when the animal was 
unable to keep up with the increasing speed and fell. Rotarod training/testing was 
performed in one 30-min session per day (20 trials in total). Performance was 
measured as the average speed animals achieved during the 20 trials. For control 
experiments, animals were either trialled 20 times by placing them on the still rod 
for 2 min, then dropped to the bottom of the chamber (no-training control), or 
by forcing them to run on the rod rotating at a constant speed of 15 r.p.m. (non- 
accelerated rotarod control, 60 min in total with a 20-s break every 5 min). A 
reverse running regime was introduced to provide pre-trained mice with a new 
motor learning experience. In this regime, animals were forced to run backwards 
on the rotating rod (speed increased gradually from 0 to 50 r.p.m. over 3 min) for 
20 trials. 

Identification of the forelimb region of the motor cortex and the barrel 
cortex. The location of imaging in the motor cortex is 1.3 mm anterior to the 
bregma and 1.2 mm lateral from the midline. In a previously published study*’, 
this region has been identified through microstimulation as the location of 
forelimb representations in the same mouse strain as we used in our study. 
We confirmed this region of forelimb representations by microstimulation in 
our own hands. In addition, the location of imaging in the barrel cortex is 
1.1mm posterior to the bregma and 3.4 mm lateral from the midline. We have 
previously confirmed this location is within the barrel cortex using cytochrome 
oxidase staining’. Because our imaging window was rather small 
(200 jtm X 200 um), we chose to use stereotaxic coordinates of previously 
mapped forelimb and barrel regions as the guide to study spine dynamics in 
motor and barrel cortices. 

In vivo transcranial two-photon imaging. The degree of spine formation and 
elimination was obtained from longitudinal studies by imaging the mouse cortex 
through a thinned-skull window. Because thinning the skull to ~20 um at each 
imaging session without damaging the cortex becomes difficult after several 
chronic imaging sessions, we designed our experiments such that the same 
animals were imaged no more than four times. For the measurement of new 
spine survival in Fig. 2b, c, most but not all of the data came from chronic 
imaging of the same mice. For the measurement of new spine accumulation 
and existing spine survival in Fig. 4a, b, a total of 57 animals were used (most 
of them were imaged twice, eight of them were imaged three or four times). 
The surgery and imaging procedures are described below. 

1. Anaesthetize the mouse with an intraperitoneal injection of ketamine/xylazine 
mix (20 mg ml! ketamine, 3 mg ml ' xylazine in saline, 5-6 ul g' body weight). 
2. Carefully shave the hair of the scalp with a double-edged razor blade. Make a 
midline incision of the scalp with sterile surgical scissors. The incision should 
extend from the middle of the ears to the frontal area. 

3. Remove the periosteum tissue with a microsurgical blade. The brain area to be 
imaged was localized based on the stereotactic coordinates and marked with a 
fine marker. 

4. Place a small amount of glue around the edges of the internal opening of the 
skull holding plate and press it against the skull for a few seconds. Make sure that 
the area to be imaged is exposed in the centre of the internal opening of the skull 
holder. 

5. Wait approximately 5 min until the plate is stably glued to the skull and then 
place the mouse on a cotton pad on top of an optional heating pad. Attach the 
skull holder to two metal cubes adhered to a large plate for immobilizing the 
holder. Wash away unpolymerized glue with artificial cerebrospinal fluid 
(ACSF). 

6. Usea high-speed micro-drill to thin a circular area of skull (typically ~0.5—1 mm 
in diameter) over the region of interest under a dissection microscope. Drilling 
should be done intermittently to avoid overheating. Replace ACSF periodically and 
wash away the bone debris. 

7. The mouse skull consists of two thin layers of compact bone, sandwiching a 
thick layer of spongy bone. The spongy bone contains tiny cavities arranged in 
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concentric circles and multiple canaliculi that carry blood vessels. Remove the 
external layer of the compact bone and most of the spongy bone with the drill. 
Some bleeding from the blood vessels running through the spongy bone may 
occur during the thinning process. This bleeding will usually stop spontaneously 
within a few minutes. 

8. After removing most of the spongy bone, use a microsurgical blade to continue 
the thinning process until a very thin (~20m) and smooth preparation 
(~200 ttm in diameter) is achieved. 

9. Use a conventional epifluorescence microscope to check if dendrites and 
spines in the area of interest can be clearly visualized at this stage. The thickness 
of the skull can also be directly determined by visualization of the skull with a 
two-photon microscope. 

10. A CCD (charge-coupled device) camera can be used to acquire a high-quality 
picture of the brain vasculature, which is used as a landmark for future relocation. 
11. Carefully move the mouse to the two-photon microscope and select an area 
for two-photon imaging. The selected area is then carefully identified and 
marked in the CCD vasculature map. 

12. Tune the two-photon microscope to the appropriate wavelength (920 nm for 
yellow fluorescent protein). Imaging is achieved by using X60 water-immersion 
objectives with numerical aperture 1.1. 

13. Obtain a low-magnification stack of fluorescently labelled neuronal processes 
at X 1 zoom, which serves as a more precise map for relocation of the same area at 
later time points in addition to the CCD image of brain vasculature. The stack is 
typically taken within ~200 ptm below the pial surface. Additional higher mag- 
nification (3 digital zoom) images can be taken by electronically moving the 
imaged area. 

14. For re-imaging the same region, find the thinned region based on the brain 
vasculature map. Carefully remove the connective tissue that has re-grown on 
top of the thinned region using a microsurgical blade, and check the image 
quality with the two-photon microscope. The skull may need to be re-thinned. 
15. Use a microsurgical blade to shave the skull carefully until a clear image can 
be obtained. 

16. Find the imaged region under a fluorescence microscope. Align the region 
according to a X1 zoom map under the two-photon microscope, then zoom in 
to X3 to align it further. 

17. After the image is precisely aligned with the first view, take images as previ- 
ously described. 

Data analysis. Image] software was used to analyse image stacks. The same 
dendritic segments were identified from three-dimensional stacks taken from 
different time points with high image quality (ratio of signal to background noise 
>4:1). The number and location of dendritic protrusions (protrusion length was 
more than one-third the dendritic shaft diameter) were identified in each view 
without previous knowledge of the animal’s age, the interval between views or 
the order of the views. The total number of spines (m) was pooled from dendritic 
segments of different animals. Filopodia were identified as long, thin structures 
(generally larger than twice the average spine length, ratio of head diameter to 
neck diameter <1.2:1 and ratio of length to neck diameter >3:1). The remaining 
protrusions were classified as spines. No subtypes of spines were separated. 
Three-dimensional stacks were used to ensure that tissue movements and rota- 
tion between imaging intervals did not influence spine identification. Spines or 
filopodia were considered the same between views if their positions remained the 
same distance from relative adjacent landmarks. Spines were considered differ- 
ent if they were more than 0.7 lum away from their expected positions based on 
the first view. 

Changes in cortical volume associated with motor skill learning and EE have 
been well documented previously*”*’. It is important to note that because we 
measured spine dynamics on the same dendrites in the same animals over time, 
our measurements of spine elimination and formation were not sensitive to 
changes in cortical volume. 
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Division and apoptosis of E2f-deficient retinal 


progenitors 
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The activating E2f transcription factors (E2fl, E2f2 and E2f3) 
induce transcription and are widely viewed as essential positive 
cell cycle regulators. Indeed, they drive cells out of quiescence, and 
the ‘cancer cell cycle’ in Rb null cells is E2f-dependent'”. Absence 
of activating E2fs in flies or mammalian fibroblasts causes cell 
cycle arrest**, but this block is alleviated by removing repressive 
E2f or the tumour suppressor p53, respectively” ’. Thus, whether 
activating E2fs are indispensable for normal division is an area of 
debate’. Activating E2fs are also well known pro-apoptotic factors, 
providing a defence against oncogenesis*, yet E2f1 can limit 
irradiation-induced apoptosis”'®. In flies this occurs through 
repression of hid (also called Wrinkled; Smac/Diablo in mam- 
mals). However, in mammals the mechanism is unclear because 
Smac/Diablo is induced, not repressed, by E2f1"', and in keratino- 
cytes survival is promoted indirectly through induction of DNA 
repair targets'*. Thus, a direct pro-survival function for E2f1-3 
and/or its relevance beyond irradiation has not been established. 
To address E2f1-3 function in normal cells in vivo we focused on 
the mouse retina, which is a relatively simple central nervous 
system component that can be manipulated genetically without 
compromising viability and has provided considerable insight 
into development and cancer”’’. Here we show that unlike fibro- 
blasts, E2f1—3 null retinal progenitor cells or activated Miller glia 
can divide. We attribute this effect to functional interchange- 
ability with Mycn. However, loss of activating E2fs caused down- 
regulation of the p53 deacetylase Sirt1, p53 hyperacetylation and 
elevated apoptosis, establishing a novel E2f—Sirtl—p53 survival 
axis in vivo. Thus, activating E2fs are not universally required 
for normal mammalian cell division, but have an unexpected 
pro-survival role in development. 

During retinal development, a thin neuroblastic layer of progeni- 
tors undergoes extensive expansion from mouse embryonic day 11 
(E11) to approximately postnatal day 8 (P8), generating post-mitotic 
differentiating cells that develop into the six major retinal neurons 
and Miller glia. To study E2f1—3 function during retinal progenitor 
expansion, floxed E2f3 mice were crossed with E2fl-/~, E2f2-’~ and 
a-Cre (which deletes E2f3 in peripheral retinal progenitors at E10“) 
mice. We then assessed Ki67 to mark all dividing cells, or phospho- 
histone H3 (PH3) and 5-bromodeoxyuridine (BrdU) incorporation 
to mark mitosis and S phase, respectively (Fig. 1 and Supplementary 
Fig. 1a). Some reduction was detected in the E2f1~’~ retina, but the 
E22’ E2f3 ’~ double knockout retina was unaffected, and E2f2 or 
E2f3 loss did not affect E2f1 ’~ progenitors (Fig. la, b and data not 
shown). At least one activating E2f is essential for fibroblast division‘, 
so we expected a drastic phenotype in the E2f1/ E2f2-/ E2f3 ’ 
triple knockout retina where E2f3 is deleted before progenitor expan- 
sion. Broad Cre-IRES-GFP transgene expression, as well as analysis of 


DNA, messenger RNA and protein at various time points, all demon- 
strated robust E2f3 excision (Supplementary Fig. 1), yet remarkably 
we observed many triple knockout Ki67* and PH3* cells at both E14 
and PO (Fig. la, b). The effect was slightly more pronounced at PO 
than E14, and is discussed later. At E14 and E17 BrdU labelling was 
weaker in triple knockout versus wild-type progenitors but many 
BrdU™ cells were obvious at higher magnification, suggesting con- 
tinued, albeit slower, division (Supplementary Fig. 1a). At E14 and PO 
cell cycle distribution was the same in wild-type, E2f1-’~ or triple 
knockout retinas (Supplementary Fig. 2), suggesting lengthening of 
all phases. Surprisingly, therefore, although activating E2fs contri- 
bute to progenitor expansion, robust division continues in their 
absence. E2f1 is essential to drive abnormal division of Rb1-deficient 
differentiating retinal neurons", highlighting the distinct role for 
activating E2fs in normal versus cancer cell cycles in the same tissue. 

Mature neurons never divide, but retinal damage triggers reactive 
gliosis and Miller glia mitosis. We damaged retinas either genetically 
using homozygous rd1 (also called Pde6b’’ ), causing photorecep- 
tor degeneration, or by means of intravitreal toxin injection. GFAP 
induction and p27‘! and cyclin D3 downregulation, hallmarks of 
reactive gliosis'®, were observed independent of £E2f1-3 
(Supplementary Fig. 3a, b and data not shown). Furthermore, the 
proportion of dividing BrdU*/CRALBP* Miiller nuclei in the inner 
nuclear layer was identical in control or triple knockout toxin-treated 
retina (Supplementary Fig. 3c, d). Thus, both progenitors and 
mature glia proliferate without activating E2fs. 

Some E2f targets (for example, cyclins/Mcm3/Tk1) were down- 
regulated in E2f1‘’ E2f2-’ E2f3 ’~ triple knockout progenitors 
(Fig. 2a), and we wondered if reduction of repressive E2fs (E2f4—8) 
might explain continued division. However, whereas E2f7 and E2f8 
mRNA levels were slightly down in the triple knockout retina, E2f4-6 
levels were unaffected at E14, E2f5 and E2f6 were induced at PO 
(Fig. 2a). Triple knockout fibroblasts arrest as a result of p53 induc- 
tion of Cdkn1a, which encodes the cyclin-dependent kinase (Cdk) 
inhibitor p21’ (refs 4, 5). Notably, expression of CdknJa and the 
related Cdknic (p57“P*) were reduced in triple knockout retina 
(Fig. 2b). Myc (previously c-Myc) represses the Cdknla promoter 
in vitro'® and its overexpression bypasses arrest caused by inhibiting 
E2f'”'*. Because E2f inhibition in these studies targeted all E2fs, it is 
unclear if Myc overrides loss of activating E2fs alone. Whether 
physiological Myc levels can compensate for E2f1-3 in vivo is also 
unknown. Notably, Mycn and Mycll levels were higher than Myc in 
E14 retina, but the reverse applied in fibroblasts (Fig. 2c). E2f1—3 loss 
reduced Myc levels in fibroblasts, but had no effect on Myc family 
expression in retinal progenitors (Fig. 2c). To test Mycn function, the 
floxed (f) locus was introduced into the triple knockout background 
and recombination confirmed (Supplementary Fig. 4). Deleting 
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Figure 1| Mycn allows division without activating E2fs. a, Retinal sections 
of indicated genotypes and ages were stained for nuclei (DAPI (4,6- 

diamidino-2-phenylindole), blue), mitoses (top two panels, PH3, red) or all 
dividing cells (bottom two panels, Ki67, green). Owing to the small size of the 
E14 E2f1 ’ E2f2-’ E2f3 ’ Mycn ‘~ retina, the lens is also visible in these 
Mycn alone, or even with E2f2/3, did not affect proliferation or sur- 
vival (Figs la, b and 3a—c and data not shown; see also Supplementary 
Discussion on eye size). In contrast, progenitor division was markedly 
reduced in the E2f1_’ E22.’ E2f3 ’" Mycn ’~ quadruple knockout 
peripheral retina with 12-fold or 67-fold fewer Ki67~ cells than wild- 
type retina at E14 and PO, respectively, and the neuroblastic layer 
was very thin (Fig. la). PH3 and BrdU analyses confirmed this 
dramatic effect (Fig. 1a, b, and data not shown). To address cell auto- 
nomy in Mycn/E2f1-3 redundancy, Cre retrovirus co-expressing 
green fluorescent protein (GFP) (Supplementary Fig. 5) was injected 
subretinally into newborn mice and clone size analysed at P21. The 
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Figure 2 | Mycn blocks Cdk inhibitor induction in E2f1-3 null progenitors. 
a, Quantitative polymerase chain reaction with reverse transcription 
(qRT-PCR) for the indicated genes in E14 and PO wild-type and 

E2fl ’ E22’ E2f3 ’~ retinas. Triple knockout expression levels are 
shown relative to wild type. b, qRT-PCR for indicated genes in E14 wild- 
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panels. Scale bar, 50 um. WT, wild type. b, Quantification of PH3* and 
Ki67~ cells. GCL, ganglion cell layer; NBL, neuroblastic layer. Data in b are 
mean = s.d. and asterisks indicate significant difference from wild type 

(n = 3). Asterisk, P< 0.05; double asterisk, P< 0.01; Student’s t-test. 


average size of wild-type, E2f2-’", E2f2-’ E2f3 ’~ and Mycn /— 
clones was similar (~2.1 cells per clone); E2fl-’ E2f2-’~ and 
E2fl_ ‘~ E2f2 ’~ E2f3 ‘~ clones were smaller (~1.7 cells per clone); 
but E2fl”’ E22’ E2f3 ’ Mycn ‘~ clones were almost always 
single cells (1.07 cells per clone) (Supplementary Fig. 6). Analysis 
of E2f target gene expression revealed that a subset of E2f targets 
that were reduced by E2f1-3 loss at E14 was further reduced in the 
E2ft /~ E2f2-’ E2f3 ’ Mycn ’~ retina, and a larger subset un- 
affected by E2f1-3 loss was downregulated in the E14 
E2fl ’ E2f2-’ E2f3 ’ Mycn ’~ retina (Supplementary Fig. 7 and 
Supplementary Discussion). Notably, whereas Cdknla and CdknIic 
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type, E2fl ’ E2f2-’ E2f3 ’~ and E2fl ’ E2f2-’ E2f3 ’ Mycn ‘~ 
retinas. Levels are plotted relative to wild type. ¢, qRT-PCR for Myc relatives 
in indicated cells. E2f1~’ E2f2-‘" E2f3 ’ cells were GFP-sorted from 
o-CreE2fl/ E2f2~/~ E23" retinas or E2f1~/ E2f2/ E2f3"" fibroblasts 
transduced with MXIE-Cre virus. All data are mean + s.d. (n = 3). 
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were reduced in the E2f1 ’ E2f2 ‘ E2f3 ‘~ retina, they were upre- 
gulated in E2fl-’ E2f2-‘ E2f3’" Mycn ‘~ retina (Fig. 2b, Sup- 
plementary Fig. 7 and Supplementary Discussion). Thus, Mycn 
promotes division of E2f1~’~ E2f2-’" E2f3‘~ progenitors cell auto- 
nomously and maintains E2f target expression while repressing Cdk 
inhibitors. 

Because E2f1—3 are non-essential for division, we considered other 
roles. Activating E2fs are pro-apoptotic’’, but at E14 and E17 E2f1-3 
loss did not affect low background apoptosis, yet remarkably, apop- 
tosis was elevated in postnatal E2f1-’ E2f2-’~ E2f3 ‘~ retina, and 
could be suppressed by any single activating E2f (Fig. 3a—-c and 
Supplementary Fig. 8). By P8, when progenitor division has ended, 
apoptosis was normal (Supplementary Fig. 8a—c). Labelling with 
TdT-mediated dUTP nick end labelling (TUNEL) and BrdU showed 
that apoptotic E2f1”~ E2f2’ E2f3 ’~ cells were progenitors (Sup- 
plementary Fig. 9). This phenotype may explain the greater reduction 
in progenitors at PO versus E14 (Fig. la, b). Thus, activating E2fs have 
an unexpected pro-survival role during retinal development. 

In flies, E2f1 protects some irradiated cells by repressing pro- 
apoptotic hid’, but in the E2f1-’ E2f2-’ E2f3 ‘~ retina the mam- 
malian equivalents Smac/Diablo and Omi/Htra2 were downregulated 


E2f3-- 


E2f2--E2t3-/- 
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NBL 


HPO 


E2t1-E2f2-- 


NBL NBL 
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GCL 
NBL NBL 
GCL 
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(Fig. 3d). The E2f target p73 was also downregulated, but immuno- 
reactivity for its relative p53 was elevated, and its pro-apoptotic target 
Noxa (also called Pmaip1) was induced (Fig. 3d—g). We introduced 
the floxed p53 locus (Trp53”) into the a-CreE2fl‘~ E2f2-’~ E2f3""back- 
ground, confirmed recombination (Supplementary Fig. 10) and found 
that whereas there was no difference in Ki67*, PH3° or BrdU* cells 
in E2fl-’ E2f2’ E2f3 “ versus E2fl ‘ E2f2 ’ E2f3 ’ Trp53 ’ 
progenitors at E14 or PO (Fig. la, b, and data not shown), apoptosis 
was suppressed in E2f1 ’ E22’ E2f3 ’ Trp53 ’ PO progenitors 
(Fig. 3a—c). Although E2f1”‘’ E2f2-’~ E23’ Mycn ‘~ PO retina had 
fewer total apoptotic cells (Fig. 3a—c), this was due to fewer retinal 
progenitors; thus, the proportion of apoptotic (TUNEL*) dividing 
(Ki67~) progenitors at PO was not reduced by Mycn loss, but was 
slightly increased (Supplementary Fig. 11), consistent with the slightly 
elevated Noxa expression relative to the E2f1/~ E2f2 ’ E2f3 ’~ retina 
(Supplementary Fig. 7). Thus, unlike in fibroblasts*, p53 does not reduce 
division in E2fl~’ E22’ E2f3 ‘~ progenitors, but drives apoptosis, 
providing and explaining the first pro-survival role for activating E2fs 
in normal cells. 

How do E2fs constrain p53 pro-apoptotic activity? Aspp proteins 
bind p53 and promote its pro-apoptotic function, but they are 
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Figure 3 | A pro-survival role for activating E2fs. a, Retinal sections of 
indicated genotypes and ages were stained for nuclei (DAPI, blue) and 
apoptosis (TUNEL, red). b, c, Quantification of TUNEL* cells: 

b, proportion; ¢c, number per 100 tum. d, qRT-PCR analysis of indicated 
genes in E14 and PO wild-type and E2f1 ‘E22’ E2f3 ’~ retinas. 
Expression levels in the triple knockout are shown relative to those in wild- 
type cells. e, Retinal sections of the indicated genotypes and ages were 
stained for nuclei (DAPI, blue) and p53 (red). f, Quantification of p53” cells. 


g, Cell lysates of PO wild-type and E2f1 ’ E2f2’ E2f3 ’~ retinas were 
probed with the indicated antibodies on western blots. 

E2f1-/ E2f2-/ E2f3 /~ cells were dissected from 

ot-CreE2f1 /~ E2f2~/~ E2f3" peripheral retina. Scale bars in a, e are 50 jum. 
GCL, ganglion cell layer; NBL, neuroblastic layer. Data in b—d, f are 

mean = s.d. Asterisks in b, ¢ indicate significant difference from wild type 
(n = 3). Asterisk, P< 0.05; double asterisk, P< 0.01; Student’s t-test. 
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E2f-induced”*”! and were downregulated in E2f1-’ E2f2-/ E2f3 /— 
progenitors; the Aspp inhibitor, Jaspp (also called Ppp1r131), was un- 
affected (Fig. 3d). p19" stabilizes p53 by inhibiting Mdm2, and E2f3 
loss de-represses p19” transcription in fibroblasts”, but we observed 
less p19” mRNA and protein in triple knockout progenitors (Fig. 3d, 
g). E2f1—3 loss did not affect Trp53 mRNA, and total p53 protein levels 
were increased only ~1.5-fold in E2fl-’ E2f2-’ E2f3 ’~ retina 
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Figure 4 | E2fs promote survival through Sirt1-mediated p53 
deacetylation. a, MXIE plasmid, MXIJE-Cre plasmid, or MXIE-Cre plasmid 
plus YESir2 plasmid expressing Sirt1 (ref. 29) were injected subretinally and 
electroporated into the PO retinas of the indicated genotypes and analysed at 
P2. GFP (green) marks transfected cells and sections were also stained for 
nuclei (DAPI, blue) and apoptosis (TUNEL, red). Arrows show examples of 
GEP' TUNEL cells. b, Quantification of GFP TUNEL" cells as a fraction 
of all GFP* cells in clones of the indicated genotype. Data are mean + s.d. 
(n = 3). Double asterisk, P< 0.01 relative to wild type; Student’s t-test. 

c, Insufficient or excess E2f activates p53, but by distinct mechanisms. Red, 
high/super-active; grey, low/inactive. d, An integrated E2f-Myc-Sirt1 
network promotes division and survival in retinal progenitors. Green and 
grey indicate active or inactive factors/functions, respectively. 
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(Fig. 3d, g), thus the 13-fold increase in immunoreactivity may reflect 
epitope exposure (Fig. 3e, f), perhaps through posttranslational modi- 
fication®’. E2f1—3 loss did not induce p53 phosphorylation (data not 
shown) but increased acetylation considerably (Fig. 3g). Acetylation is 
essential for p53 activity”, is mediated by Tip60, Pcaf and Cbp/p300 
and is reversed by Hdacs and Sirt1**. Notably, Sirt1, Sirt2, Sirt3, Hdac1 
and Hdac3 mRNA was reduced in triple knockout retina, as were 
protein levels of Sirtl and Hdacl (Fig. 3d, g). Hdacl and Hdac3 down- 
regulation was greater at PO than at E14, correlating with p53 immuno- 
detection at PO (Fig. 3d-f). Sirtl is a direct E2f1 target”, but whether 
other activating E2fs sustain its expression is unknown, and a func- 
tional E2f-Sirt1—p53 axis has not been established. Electroporating Cre 
plasmid into E2fl~’ E22’ E2f3" PO retinas induced apoptosis, but 
strikingly concomitant Sirt1 expression blocked cell death (Fig. 4a, b). 
Moreover, treatment of pregnant dams from E16 with the Sirt] agonist 
resveratrol blocked >70% of E2f1~’ E2f2-’ E2f3 ‘progenitor 
apoptosis at PO, and this correlated with p53 deacetylation and block- 
ade of p53-dependent Noxa induction, without affecting p19“ 
(Supplementary Fig. 12). Just as p53 loss did not influence division 
(Fig. 1a, b), resveratrol did not alter cyclin levels or proliferation 
(Supplementary Figs 12d and 13). In summary, activating E2fs inter- 
changeably induce Sirt1 to block p53 acetylation and apoptosis. High 
levels of E2f have long been known to activate p53 (ref. 8) and our 
findings now connect low E2fto p53 activation, but through an entirely 
distinct mechanism (Fig. 4c). 

Activating E2fs are critical for division in flies and mammalian 
fibroblasts, where they counteract repressive E2f or p53, respectively*’. 
We provide the first in vivo evidence that E2f1—3 are not indispensable 
in every context. Mycn drives E2f-independent division, maintaining 
expression of E2f targets and preventing p53-mediated Cdkn1a induc- 
tion. In vitro overexpression studies suggested that Myc overcomes E2f 
repression'”'* but did not reveal whether normal Myc levels substitute 
for activating E2fs in vivo. Our genetic strategies reveal that physio- 
logical levels compensate for the activating E2fs cell autonomously. 
Other studies highlight redundancy among related cell cycle regula- 
tors*®. Our results indicate redundancy between unrelated cell cycle 
regulators. 

We also describe an unexpected pro-survival activity for E2f1-3 in 
vivo. Loss of any two was harmless, but removing all three impaired 
progenitor survival. Thus, activating E2fs have interchangeable pro- 
survival roles—not only E2f1 and not only in irradiated cells”!'”. We 
define the first direct mechanism linking activating E2fs to mam- 
malian cell survival through Sirtl and p53. E2f1-3 redundantly reg- 
ulates other deacetylases (Fig. 3d), which may also inhibit p53. 
Integrating the Sirtl-p53 and Mycn data exposes a network that 
coordinates progenitor proliferation and survival (Fig. 4d). The 
novel E2f-Sirt—p53 axis could also promote survival in p53* tumour 
cells. Irrespective, its importance for survival in normal cells under- 
scores the need for care in applying E2f inhibitors to treat human 
disease. 


METHODS SUMMARY 


For reactive gliosis 1.0 ul of 2mM domoic acid and 7 mM ouabain plus 1 mM 
BrdU was injected into the eyes of P16 ketamine/xylazine anaesthetized mice”. 
For resveratrol, timed pregnant female mice were injected daily with 4mgkg ' 
of body weight (catalogue number 60512A, AKSci) or 0.1% DMSO intraperito- 
neally from E16. Virus injection and electroporation were as described’’. 
Fixation, antibody labelling, TUNEL and quantification of sections, or dissoci- 
ation and quantification of cells, and RNA and protein analysis were essentially 
as described'*”*. The PALM microlaser system was used for laser capture micro- 
dissection (LCM) following the manufacturer’s recommendations. For flow 
cytometry E23" and E2fl ‘/ E2f2 ’ E2f3"" fibroblasts were infected with 
MXIE or MXIRE-Cre retrovirus. E14 retinas were dissected and dissociated. 
GFP* cells were sorted using a FACSAria (Becton Dickinson). For cell cycle 
analysis, cells were fixed in ice-cold 80% ethanol, counterstained with propidium 
iodide and analysed with a BD FACSCalibur system. Data were collected using 
CELLFIT software. 
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Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Mice, genotyping and resveratrol injection. Mice were treated according to 
institutional and national guidelines. «-Cre (P. Gruss)”, ROSA26R (Jackson 
Laboratories)*', E2fr- mice (M. Greenberg) and E2f2", E2ps'! (ref. 32), 
Trps3!! (ref. 33), Mycn'! (ref. 34) and Pde6b/~ (referred to as rdl, Jackson 
Laboratories) mice were maintained on a mixed (NMRI X C57/Bl x FVB/ 
N X 129sv) background. Genotyping was performed according to published 
protocols. For resveratrol assays, timed pregnant female mice were injected daily 
with 4 mgkg ! of body weight resveratrol (catalogue number 60512A, AKSci) or 
0.1% DMSO intraperitoneally from E16. 

Immunofluorescence. Eyes were fixed in 4% paraformaldehyde for 1h at 4 °C, 
embedded in OCT (TissueTek 4583), frozen on dry ice and cut into 12 um 
sections on Superfrost slides. BrdU (100 pg g | body weight) was injected intra- 
peritoneally into mice 2 h before they were killed. BrdU™ cells were detected 
using a biotin-conjugated sheep polyclonal antibody (1:500, Maine 
Biotechnology Services). Other antibodies were CRALBP (J. Saari), cyclin D3 
(Santa Cruz, SC-6283), E2f3 (Upstate, 05-551), GFP (Molecular Probe, A11122), 
GFAP (Sigma, G9269), Ki67 (BD Biosciences Pharmingen, 550609), Ki67 
(Neomarkers, RM-9106S), p27?! (Santa Cruz, SC-528), Trp53 (NCL-p53- 
CM5p; Novocastra) and phospho-histone H3 (Upstate Biotechnology, 
06-570). TUNEL and antigen retrieval was performed as described’. For double 
labelling of BrdU and other markers, sections were first stained for the other 
marker, and fixed again with 70% ethanol/10% acetic acid for 5 min. Sections 
were washed, treated with 2 N HCl, and stained for BrdU. Primary antibodies 
were visualized using goat anti-mouse Alexa-488 or Alexa-568, goat anti-rabbit 
Alexa-488 or Alexa-568, donkey anti-goat Alexa-568, Streptavidin Alexa-488 or 
Alexa-568 (1:1,000; Molecular Probes) and Streptavidin- HRP-DAB. Nuclei were 
counterstained with 4,6-diamidino-2-phenylindole (DAPI; Sigma). Labelled 
cells were visualized using a Zeiss Axioplan-2 microscope with Plan Neofluar 
objectives and images captured with a Zeiss AxionCam camera. For double- 
labelled samples, confocal images were obtained with a Zeiss LSM 5.0 Laser 
Scanning microscope. For quantification of stained sections, the retina was 
separated into bins*. Measurements were performed with Axiovison software. 
Quantification used horizontal sections containing the optic nerve. At least three 
sections per eye and three eyes from different litters were counted. Statistical 
analysis used the Student’s t-test. P values are based on two-sided hypothesis 
testing. 

Reactive gliosis. 1.0 ul of 2mM domoic acid and 7mM ouabain plus 1 mM 
BrdU was injected into the eyes of ketamine/xylazine anaesthetized P16 mice”’. 
Laser capture microdissection. Retinal cells were dissected from 12 jim sections 
using the PALM microlaser system, catapulted into 10 tl buffer (1 mM EDTA; 
20mM Tris (pH8)) containing 2mgml | proteinase K, incubated at 55°C 
overnight, and proteinase K inactivated at 99 °C for 10 min. 5 ull was used for 
PCR. 

Flow cytometry. £2f3 and E2f1~’ E2f2-’ E2f3/! mouse fibroblasts were 
infected with MXIE or MXIRE-Cre retrovirus. E14 retinas were dissected and 
dissociated. GEP* cells were sorted using a FACSAria (Becton Dickinson). For 
cell cycle analysis, cells were fixed in ice-cold 80% ethanol, counterstained with 
propidium iodide and analysed with a BD FACSCalibur system. Data were 
collected using CELLFIT software. 

RT-PCR and western blotting. Total RNA from fibroblast cells, peripheral 
retina or GFP-sorted cells was prepared using the RNeasy Mini kit (Qiagen), 
and treated with DNA-free (Ambion) according to the manufacturer’s instruc- 
tions. First-strand cDNA was synthesized from 0.2-0.5 1g using SuperScript II 
(Invitrogen); primers are listed in Supplementary Table 1. An Applied 
Biosystems PRISM 7900HT and SYBR Green PCR master mix was used for 
real-time PCR. Tests were run in duplicate on three biological samples. Values 
were normalized to B-actin. 

For western blots, peripheral mouse retinas were homogenized with a 30 gauge 
needle 5-10 times in 1 X cell lysis buffer (Cell Signaling) with 0.1 mM PMSF, 1 pg 
ml! aprotinin, 1 pgml~! leupeptin, 2mM NaN, 10 uM trichostatin A and 
5mM niacinamide. Proteins were separated by 12% SDS-PAGE, transferred 
to nitrocellulose and analysed by Li-Cor system (Lincoln) with antibodies 
against p53 (NCL-p53-CM5p; Novocastra), acetylated p53 (ab61241; Abcam), 
phosphorylated p53 (9286S; Cell Signaling), Sirtl (07-131; Upstate), p21¢P! (M- 
19; Santa Cruz), pls" (NB200-106; Novus Biologicals), Noxa (IMG451; 


nature 


Imgenex), HDAC1 (H-51; Santa Cruz), HDAC3 (H-99; Santa Cruz) and B-actin 
(A5441, Sigma). 

Retroviral injection and in vivo electroporation. Cre recombinase cDNA (L. 
van Parijs) was cloned into the BglII site of MXIE and retrovirus generated in 
Phoenix-eco cells as described’’. Recombinase activity was confirmed in 
ROSA26R mice (Supplementary Fig. 5) and by PCR of LCM-dissected single cells 
(M.P. and R.B., unpublished data). PO pups were anaesthetized on ice. 2 ul retro- 
virus was injected into the subretinal space of right (MXIE) or left eye (MXIE-Cre) 
through a small limbal incision. P21 retinas were stained for GFP. Note that if an 
E2ft ’ E2f2-’ E2f3 ‘~ progenitor dies after introduction of Cre at PO, then by 
definition it will not generate any post-mitotic cells and thus will not generate a 
clone that can be counted at P21. Only the E2fl ’ 2/2’ E23 ’~ progenitors 
that survive contribute to clones that are counted at P21. It is impossible to 
estimate accurately using this assay the fraction of progenitors that die because 
the number of clones will vary depending on technical variance associated with 
injection technique. This assay measures the number of cells per clone rather than 
the number of clones. That is, it measures the ability of surviving progenitors to 
divide (clone size), and not the degree of progenitor survival. 

For electroporation, 1 pl of DNA (1-4 1g pl!) was injected into the subretinal 

space of PO pups using a Hamilton syringe, and square electric pulses (80 V; five 
50-ms with 950-ms intervals) applied with tweezer-type electrodes (CUY21 
EDIT Electroporator). 
Selection of E2f targets. E2f targets for qRT-PCR analysis were selected from 
primary papers and reviews, focusing mainly on chromatin immunoprecipita- 
tion studies!!?!??>?>!, Their interaction with and/or regulation by Myc family 
proteins (summarized in Supplementary Fig. 7) was then based on surveying 
both the literature’ and a database of >1,600 Myc targets (http://www. 
myc-cancer-gene.org/)**. 
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E2f1-3 switch from activators in progenitor cells to 
repressors in differentiating cells 


Jean-Leon Chong’***, Pamela L. Wenzel’?°*+, M. Teresa Saenz-Robles**, Vivek Nair’, Antoney Ferrey’, 
John P. Hagan’”, Yorman M. Gomez'””, Nidhi Sharma’, Hui-Zi Chen’, Madhu Ouseph’””, 
Shu-Huei Wang’””, Prashant Trikha’””, Brian Culp’, Louise Mezache’””’, Douglas J. Winton’, Owen J. Sansom®, 


Danian Chen’, Rod Bremner’, Paul G. Cantalupo’, Michael L. Robinson®, James M. Pipas* & Gustavo Leone 


In the established model of mammalian cell cycle control, the 
retinoblastoma protein (Rb) functions to restrict cells from enter- 
ing S phase by binding and sequestering E2f activators (E2fl, E2f2 
and E23), which are invariably portrayed as the ultimate effectors 
ofa transcriptional program that commit cells to enter and progress 
through S phase’”. Using a panel of tissue-specific cre-transgenic 
mice and conditional E2f alleles we examined the effects of E2f1, 
E2f2 and E2f3 triple deficiency in murine embryonic stem cells, 
embryos and small intestines. We show that in normal dividing 
progenitor cells E2f1-3 function as transcriptional activators, but 
contrary to the current view, are dispensable for cell division and 
instead are necessary for cell survival. In differentiating cells E2f1-3 
function in a complex with Rb as repressors to silence E2f targets 
and facilitate exit from the cell cycle. The inactivation of Rb in 
differentiating cells resulted in a switch of E2f1—-3 from repressors 
to activators, leading to the superactivation of E2f responsive targets 
and ectopic cell divisions. Loss of E2f1-3 completely suppressed 
these phenotypes caused by Rb deficiency. This work contextualizes 
the activator versus repressor functions of E2f1-3 in vivo, revealing 
distinct roles in dividing versus differentiating cells and in normal 
versus cancer-like cell cycles. 

E2fs function as transcription factors, with E2f1—3 as activators 
and E2f4-8 as repressors *. Although it is a maxim of mammalian 
cell cycle regulation that the E2f1—3 activator subclass is required for 
cell proliferation, the evidence for this is based almost exclusively on 
in vitro studies using cells derived from murine and human tissues or 
on the in vivo analysis of Rb mutant mice'*. Other experiments, 
however, suggest that these E2fs can also function as repressors in 
complex with Rb’”’, yet the relative contribution of activation versus 
repression and the physiological contexts in which these contrary E2f 
functions are used remain unclear. 

To explore the functions of the E2f activator subclass, we derived 
E2fl /~ E2f2-/~ E2f3'"’~ embryonic stem (ES) cells (Supplementary 
Fig. la, b) and compared the consequences of inactivating the con- 
ditional £23" allele in ES cells and E2f1~/ E2f2-/ E2pse’/x? 
mouse embryonic fibroblasts (MEFs). The expression of E2f1, E2/2 
and E£2f3 in wild-type ES cells was generally higher than in MEFs 
and the loading of E2f3 protein on classic E2f target promoters was 
comparable between the two proliferating cell types (Supplementary 
Fig. 2a—-c). Consistent with previous observations, the ablation 
of E2fl-3 in MEFs with standard cre-expressing vectors led to the 


1,2,3 


induction of p53 activity, the loading of E2f4-p130 repressor com- 
plexes on E2f target promoters and a marked decrease in E2f target 
expression (Fig. la and Supplementary Fig. 3a—c)*’. Consequently, 
triply deficient MEFs underwent a complete cell cycle arrest 
(Fig. 1b)*’. In contrast, E2fl~/~ E2f2-‘~ E2f34’~ ES cells failed to 
activate p53 or form E2f4—p130 repressive complexes, and as a result, 
E2f target expression was unaffected and cells proliferated equally well 
as E2fl /~ E2f2-/~ E2f3'"’”— double knockout control cells (Fig. 1b 
and Supplementary Fig. 3a—c). 

We then evaluated whether triply-deficient ES cells could proliferate 
in vivo. Subcutaneous injection of E2f1-/~ E2f2-/~ E2f3“” ES cells into 
athymic nude mice yielded efficient teratoma formation, producing 
mesoderm, endoderm and ectoderm at a rate similar to E2fl 
E2f2  E2f3'"” double knockout ES cell lines (Fig. 1c and Sup- 
plementary Fig. 4a, b). Moreover, from E2f1‘’ E2f2-’ E2f3*’~ inter- 
crosses we recovered the expected number of live E2fl 
E2f2-’~ £234’ triple knockout embryos as late as embryonic day 
(E)9.5, but none was recovered past E11.5 (Fig. 1d and data not shown). 
The live E9.5 E2f1 / E2f2-’~ E2f34’” embryos appeared morphologi- 
cally normal by gross and histological examination (Fig. le and data not 
shown). Although cell proliferation was normal in most tissues, there 
was evidence of decreased proliferation and increased apoptosis in the 
myocardium and the first branchial arch of E2f1~/ E2f2-/ E2f3/— 
embryos (Supplementary Fig. 5a—d). These latter observations are con- 
sistent with heart defects found in E2f3 singly-deleted adult mice’’. 

To explore whether E2f1—3 might have cell-cycle-related functions 
in tissues that arise later in embryonic and postnatal development, we 
exploited the highly organized cellular architecture of the small 
intestine. Maintenance of structural and functional integrity of the 
small intestine requires continuous epithelial regeneration’. 
Intestinal stem cells are housed at the base of crypts of Lieberkiihn 
and give rise to transit-amplifying cells. As these cells migrate up from 
the base and into the finger-like extensions called villi, they exit the 
cell cycle and differentiate’. Western blot assays showed that E2f1, 
E2f2 and both isoforms of E2f3 (E2f3a and E2f3b) are expressed in the 
crypt and villus (Supplementary Fig. 6). We used Ah-cre mice’ to 
ablate E2f1-3 in the small intestine in utero or in adult mice 
(Ah-creE2fl/~ E2f2-/ E2f3'°"”"°*” triple knockout). Induction of 
Ah-cre expression by intraperitoneal injection of B-napthoflavone 
(B-NF) led to the efficient deletion of E2f3'°*” in crypt stem cells 
and transit-amplifying cells by 1 day after injection, and in the entire 
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Figure 1| Cell proliferation in the absence of E2f1-3. a, Expression of E2f- 
regulated genes was measured by real-time RT-PCR in proliferating ES and 
MEF cells with the indicated genotypes (primer information is provided in 
Supplementary Fig. 19). DKO, E2f1_/~ E2f2~’~ E2f3'°°*”’— (double knockout); 
TKO, E2f1~’" E22’ E23” (triple knockout). Error bars represent 
standard deviation from samples analysed in triplicate. b, Growth curves of 
two sets of E2f1~/~ E2f2-/ E2f3'°”’— (DKO) and E2f1~/ E2f2~/" E2f3/— 
(TKO) ES cell clones (A and B; right two panels), and control-vector treated 
E2fl / E22’ E2f3'*?/"*? (DKO, left panel) and cre-vector treated 
E2f1~/” E2f2~/~ E2f3'*?""*? MEFs (TKO, left panel). ¢, Double knockout and 
triple knockout ES cells were injected underneath the skin of athymic nude 
mice and teratomas were harvested, sectioned and stained with haematoxylin 
and eosin. Representative tissues of double knockout and triple knockout 
teratomas include muscle (mesoderm), respiratory epithelium (endoderm), 
skin and neural cells (ectoderm). d, Embryos derived from intercrosses 
between E2f1*’ E2f2 ’ E2f3'’” mice were collected at various time points 
during pregnancies. Shown are the total number of embryos collected, with 
the number of dead embryos (round brackets) and total number of embryos 
expected (square brackets) also indicated. e, Representative E9.5 embryos 
were photographed immediately upon collection. 


intestinal epithelium within 3-4 days (crypt and villus; Supplemen- 
tary Fig. 7a—c). Loss of E2f1-3 did not result in a compensatory 
increase of other E2f family members, except for a modest increase 
in E2f8 (Supplementary Fig. 7d). Whether E2/3'° was deleted in 
utero at E15.5 or in the adult at 2 months of age, the architecture of 
Ah-creE2fl /~ E2f2-/~ E2f3'?"*? small intestines remained rela- 
tively intact and animals were asymptomatic for 90 days after B-NF 
administration (Fig. 2a and Supplementary Fig. 8a, b). Cell-type- 
specific marker analysis demonstrated that all differentiated epithe- 
lial cell types were appropriately represented in B-NF treated 
Ah-creE2fl /~ E2f2-/~ E2f3'?/"*? (£2f1-3-deficient) small intestines 
(Fig. 2b and Supplementary Fig. 9). Remarkably, cell proliferation 
was identical in E2fl—3-deficient and control intestines (Fig. 2c); 
however, we noted a marked increase in the phosphorylation of 
H2AX (y-H2AX) and ataxia telangiectasia (P-ATM!”) proteins in 
E2f1—3-deficient crypts and villi (Fig. 2d, e and Supplementary 
Fig. 10a). A parallel analysis of retinal’® and lens (P.L.W., unpublished 
observations) progenitors also revealed increased y-H2AX staining in 
E2f1—3-deficient samples (Supplementary Fig. 10b, c). Together, these 
observations suggest that counter to current dogma, E2f1-3 are dis- 
pensable for the proliferation of ES cells and their mesodermal, endo- 
dermal and ectodermal derivatives, and for the proliferation of cells in 
at least some adult tissues. 
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Figure 2 | Apoptosis of crypt intestinal cells in the absence of E2f1, E2{2 
and E2f3. a, Haematoxylin-and-eosin-stained sections from 

E2fl-/ E2f2-/~ E2f3'*"""*? (control) and Ah- 

creE2fl/~ E2f2-/~ E2f3'*?/"*? (Ah-cre) intestines after 90 days of B-NF 
administration. b, Analysis of cell differentiation in control and Ah-cre small 
intestines. Goblet cells were identified by Alcian blue staining (arrows point 
to positive-stained goblet cells); absorptive cells were identified by anti-fatty 
acid binding protein (FABP, green) antibodies; 4,6-diamidino-2- 
phenylindole (DAPI; blue) was used for staining nuclei. ¢, BrdU (brown) and 
phosphorylated histone H3 (P-H3, red) immunohistochemical staining was 
performed on small intestine sections from B-NF-injected control and Ah- 
cre mice. Quantification of BrdU- and phosphorylated-histone-H3-positive 
cells in crypts and villi. n = 3, 3 different animals with the indicated 
genotypes were analysed (bottom panels); error bars indicate standard 
deviation. d, Immunohistochemical staining for y-H2AX and P-ATM?”®! in 
control and Ah-cre intestinal crypts and villi. The orange dotted line outlines 
the luminal side of the villus; the white dotted line outlines the outer side of 
the villus. DAPI (blue) was used for staining nuclei. e, Examination of 
y-H2AX and P-ATM!** in cell extracts from control and Ah-cre intestinal 
crypts and villi by western blot assays. f, Sections of small intestines from 
B-NF-injected control and Ah-cre mice were processed for TUNEL (brown) 
and cleaved caspase 3 (red) assays. DAPI (blue) or haematoxylin was used 
for staining nuclei. Arrows point to TUNEL-positive or cleaved caspase-3- 
positive cells. Quantification of TUNEL and cleaved caspase-3-positive cells 
in crypts and villi is shown in the bottom panels. n = 3, 3 different animals 
with the indicated genotypes were analysed; error bars indicate standard 
deviation. 


Close examination of haematoxylin-and-eosin-stained slides 
revealed increased numbers of pyknotic nuclei in E2f1—3-deficient 
crypts (data not shown). TdT-mediated dUTP nick end labelling 
(TUNEL) and cleaved caspase-3 assays confirmed the presence of 
apoptotic cells in crypts of E2f1—3-deficient intestines (Fig. 2f). We 
also observed increased p53 immunoreactivity in E2fl—3-deficient 
crypts (Supplementary Fig. 11a), which was reminiscent of previous 
work showing exquisite sensitivity of this cellular compartment to 
oncogene- and radiation-induced p53 responses’®. Although p53 was 
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elevated in E2f1—3-deficient crypts, we failed to detect any significant 
increase in the expression of p53-responsive genes, and moreover, the 
conditional ablation of p53 (B-NF treated Ah-creE2fl /~ 
E2f2 / E2fze oP y53ie"xP) did not suppress the apoptosis 
caused by E2f1-3 deficiency (Supplementary Fig. 11b, c). Together, 
these observations suggest that E2f1—3 are dispensable for cell divi- 
sion in the adult and that at least in the small intestine they function 
in a p53-independent manner to maintain DNA integrity and cell 
survival. 

To understand the underlying mechanism for these unexpected 
results, we isolated crypt and villus cell populations from control and 
E2f1—3-deficient small intestines and analysed global gene expression 
profiles. Sample preparation and processing of the Affymetrix oligo- 
arrays are described in the Methods section. We used an unbiased 
method similar to Gene Set Enrichment Analysis to identify genes 
that were differentially expressed’’. Two variables contributed to the 
observed gene expression changes: cell compartment (crypt versus 
villus) and genotype (E2f1—3-deficient versus control). The cell com- 
partment analysis compared gene expression in crypts and villi of the 
same genotype (Fig. 3a). For control small intestines, this revealed 
that among the ~45,000 genes queried, 1,207 genes were upregulated 
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and 2,363 genes were downregulated as progenitor cells in the crypt 
migrated up into the villus and exited the cell cycle (>1.5-fold, 
P<0.0001; Supplementary Fig. 12a and Supplementary Table 1). 
As expected, the expression of most known E2f targets, as defined 
by previous gene expression’®, reporter and chromatin immunopre- 
cipitation assays’” (Supplementary Fig. 12b), was markedly higher in 
control crypts than in associated villi (Fig. 3a, left panel), consistent 
with the proliferative status of crypts. For E2fl—3-deficient small 
intestines, the expression of E2f targets in crypts was only marginally 
higher than in their associated villi (Fig. 3a, right panel), suggesting 
that expression of these genes were either reduced in crypts, elevated 
in villi, or both. 

The genotype analysis compared gene expression in E2f1-3- 
deficient versus control samples of the same cell compartment. 
This comparison revealed a modest but significant downregulation 
of E2f targets in progenitor cells of E2f1—3-deficient crypts, which 
included many but not all known classic targets such as Cdc6, Ccna2, 
Ccne2, Top2 and Hmgb2 (Fig. 3b, c, left panels and Supplementary 
Fig. 13a). We suspect that continued proliferation of E2f1—3- 
deficient progenitors in the small intestine when E2f targets are 
limiting probably contributes to replicative stress, DNA damage 


Figure 3 | Repression of E2f target genes in 
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immunoprecipitation assays of cell extracts 
prepared from control villi and crypts. 
Immunoprecipitations (IP) used anti-E2f3 
antibody or IgG. Anti-Rb antibody was used to 
probe western blot (WB; top panel). The 
specificity of the anti-E2f3 antibody was 
evaluated in intestinal lysates derived from B-NF 
treated Ah-creE2f1~/ E2f2-/~ E2f3'P/ox? 

(3 ’-), E2f3a ’ (a ’ ), E2f3b ’ (b ’ ) and 
E2f3*/* (3*'*) mice. Anti-Rb antibody was used 
to probe western blot (WB; bottom panel). 
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and the observed increase in y-H2AX labelling in these cells. Whether 
these aberrant processes are linked to the death of E2f1—3-deficient 
progenitor cells remains to be evaluated rigorously. The genotype 
comparison also revealed a remarkable upregulation of a large num- 
ber of E2f targets in differentiated cells of the E2f1—3-deficient villus 
(Fig. 3b, c, right panels, and Supplementary Fig. 13a). Western blot 
assays and immunofluorescence staining showed that the accumula- 
tion of two of these E2f target gene products, Mcm3 and Pcna, was 
widespread throughout the E2f1—3-deficient villus (Fig. 3d and 
Supplementary Fig. 13b). Similarly, there was increased expression 
of E2f targets in differentiated E2f1—3-deficient cells of the retina and 
lens (Supplementary Fig. 13c; P-L.W., unpublished observations), sug- 
gesting a general role for E2f1-3 in transcriptional repression in post- 
mitotic cells in vivo. Chromatin immunoprecipitation (ChIP) assays 
using villus-enriched lysates derived from control and E2f1—3-deficient 
small intestines showed that E2f3 occupies E2f binding sites on classic 
E2f target promoters (Fig. 3e). Importantly, co-immunoprecipitation 
assays using intestinal epithelial cells derived from E2f3 ’ , E2f3a ‘~ 
and E2f3b ‘~ villi showed that both E2f3a/b isoforms” ” participate in 
a complex with the Rb protein (Fig. 3f). Consistent with this, Rb was 
found to be hypophosphorylated in the villus (Supplementary Fig. 14a). 
Together, these data indicate that E2f1—3 act as transcription activators 
in dividing progenitors, and as repressors (in complex with Rb) in 
differentiating cells of the small intestine. 

The observation that E2f1—3 repress E2f targets and are dispens- 
able for cell proliferation seems to contradict previous findings from 
the analysis of Rb/E2f double knockout animals'**. Therefore, to 
explore thoroughly the mechanistic relationship between Rb and 
E2f1—3, we used the small intestine as an in vivo system where results 
could be uniformly compared across different genetic configura- 
tions. The Ah-cre-mediated inactivation of Rb in utero or in adult 
mice resulted in increased proliferation of cells in the villus compart- 
ment but not in the crypt (Fig. 4a, b and Supplementary Fig. 14b-e), 
indicating that Rb-deleted transit-amplifying cells failed to exit the 
cell cycle appropriately. There was, however, no concomitant increase 
in apoptosis or defect in cell differentiation (Supplementary Fig. 15a, b), 
and asa result, Rb-deficient villi appeared uniformly hyperplastic. The 
combined ablation of E2f1, E2f2 and E2f3 completely suppressed the 
unscheduled proliferation and hyperplasia caused by Rb deficiency 
(B-NE treated Ah-creE2fl~/~ E2f2-/~ E2p3i/oPRplox?xP quadruple 
knockout; Fig. 4a, b and Supplementary Fig. 16a). Notably, the basal 
levels of proliferation in quadruple knockout crypts were indistinguish- 
able from control or E2f1—3-deficient samples (Supplementary Fig. 16b), 
consistent with the rather normal development of E2f1—3-deficient small 
intestines containing an intact Rb gene. 

The selective requirement for E2f1—3 in the proliferation of Rb- 
deficient cells provided an opportunity to dissect possible cancer- 
specific mechanisms of E2f in cell cycle control. We therefore 
compared global gene expression programs in B-NF treated control, 
Ah-creRb’??""*? (Rb-deficient), Ah-creE2f1~/~ E2f2~/~ E2f3iex?/? 
(E2f1-3-deficient) and Ah-creE2fl/~ E2f2-/~ E2f3e"?? Ryle loxP 
(Rb/E2f1—3-deficient) intestinal epithelia. Several important insights 
came from this analysis. First, there were expansive gene expres- 
sion differences between control and Ah-creRbI?”*? villi (1,290 
upregulated and 487 downregulated genes; Fig. 4c and Supplementary 
Table 2), but relatively minor differences in their associated crypts 
(Supplementary Fig. 17a, b). Gene Ontology algorithms” identified a 
bias for differentially expressed genes involved in the regulation of tran- 
scription, DNA metabolic processes and cell cycle (Supplementary 
Table 3). Immunofluorescence and quantitative polymerase chain reac- 
tion with reverse transcription (RT-PCR) assays confirmed the marked 
accumulation of most E2f target genes in Rb-deficient villi (Supplemen- 
tary Fig. 17c, d). From these data we conclude that Rb is critical for the 
repression of E2f targets at a time when progenitor cells commit to exit 
the cell cycle and terminally differentiate. Second, hierarchical clustering 
of all data sets showed that E2f1—3-deficient and Rb/E2f1—3-deficient 
tissues clustered together in a separate group from control and Rb- 
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Figure 4 | E2f1-3 contribute to the ectopic cell proliferation caused by Rb- 
deficiency. a, BrdU analysis was performed in B-NF treated wild type 
(control), Ah-creRb!*?"*? (Rb knockout (KO)) and Ah- 

creE2fl/~ B2f2~/~ E2f3i?/P Rplex?/*? (quadruple knockout (QKO)) 
small intestines. b, Quantification of BrdU incorporation. n = 3, 3 different 
animals with the indicated genotypes were analysed; error bars indicate 
standard deviation. ¢, Scatter plot analysis comparing differentially 
expressed E2f target genes in B-NF treated control, Ah-creRb!°?"*? (Rb 
KO), Ah-creE2f1 ’~ E2f2-/~ E2f3'*?/*” (TKO) and Ah- 

creE2fl/~ E2f2~/~ E2f3'xP/xP Rp!?o*P (QKO) villi; n = 3 for each of the 
eight samples. Red dots indicate genes whose expression increased >1.5-fold 
and blue dots indicate genes that decreased >1.5-fold. d, Quantitative 
RT-PCR analysis of selected E2f target genes in B-NF treated control (ctrl), 
Ah-creRb?""*? (Rb KO), Ah-creE2ft’~ E2f2~’~ E2f3'?"”"°*? (TKO) and 
Ah-creE2fl ~~” Bafa! Ea p3lexPilox? p ploxPloxP (QKO) villi. The normal basal 
level of E2f target expression is illustrated as a grey dotted line and the 
threshold level of E2f target expression required for ectopic proliferation is 
illustrated as a red dotted line. Error bars represent standard deviation from 
samples analysed in triplicate. 


deficient tissues (Supplementary Fig. 18), indicating that some functions 
coordinated by E2f1-3 may be Rb-independent. Finally and most 
importantly, the expression levels of E2f targets in E2f1—3-deficient 
and Rb/E2f1—3-deficient villi were equivalent, and although higher than 
in control villi, they were substantially lower than in Rb-deficient villi 
(Fig. 4c). Quantitative RT-PCR assays confirmed the relative expression 
of E2f targets to be: control < E2f1—3 deficient = Rb/E2f1-3 deficient 
< Rb deficient villi (Fig. 4d). From these data, we conclude that the 
supra-elevated expression of E2f targets observed in Rb-deficient villi is 
due to both ‘derepression’ (lacking intact Rb-E2fl-3 repressor com- 
plexes) and E2f1—-3-mediated ‘hyper-activation’. In the absence of E2f1— 
3-mediated hyper-activation, cells in Rb/E2f1l—3-deficient villi fail to 
hyper-activate and thus do not accumulate sufficient levels of E2f targets 
to undergo “ectopic cell proliferation (this threshold level of expression 
is illustrated as a red dotted line in Fig. 4d). 

We provide overwhelming evidence showing that normal cell pro- 
liferation in mice can be maintained in the absence of activator E2fs. 
We conclude that E2f1—3, like G1 Cdks***°, are not as critical for 
normal cell proliferation in mammals as original studies implied**””~”°. 
However, not all is well in the absence of E2f1—3, as B-NF treated 
Ah-creE2fl/~ E2f2-/~ E2f3'""*? dividing progenitors in the small 
intestine undergo apoptosis. A pro-survival role for E2f1-3 was also 
evident in retinal progenitor cells of the mouse’’; however, in the retina 
cell death was p53 dependent whereas in the small intestine it was p53 
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independent. Thus, the sensitivity of E2f1-3-deficient cells to p53 
activation varies considerably across tissue types. 

The findings presented here also expose dual functions for E2f1-3 
in transcription activation and repression in vivo. In dividing pro- 
genitor cells, when Rb is inactive (hyperphosphorylated), free E2f1—3 
are used to optimally activate the expression of target genes. The 
inability to do so in E2f1—3-deficient tissues still permits cells to 
replicate their DNA and divide, but at the cost of increased DNA 
damage and cell death. As cells commit to a terminally differentiated 
fate, phosphorylated Rb is dephosphorylated and forms a physical 
complex with E2f1—3 proteins. We propose that this is not just to 
sequester E2f activators but rather, to form the first repressive com- 
plex that is necessary to downregulate E2f targets and usher transit- 
amplifying cells out of the cell cycle. Once cells exit the cell cycle, 
other E2F repressor complexes accumulate, including p130-E2f4 and 
p1l07-E2f4, to more permanently enforce the repression of E2f 
targets. Given that inactivation of Rb, but not p107 or p130 (ref. 31), 
induces ectopic cell divisions in the small intestine, we suggest that Rb 
has a unique role in transit-amplifying cells that is dependent on its 
ability to associate with E2f1-3. Maintenance of quiescence in termin- 
ally differentiated cells of the villus, however, is a function that is 
shared among all members of the Rb family’’. This work challenges 
the current paradigm of cell cycle control and provides a unified 
molecular view of how the dual functions of E2f1—3 in transcriptional 
activation and repression are used in vivo to control normal versus Rb- 
mutant or cancer cell cycles. 


METHODS SUMMARY 


Mice (E2ft/ . E22’ - E2fs!, Ah-cre and Rb) used for the studies were in a 
mixed background (129SvEv, C57BL/6NTac and FVB/NTac). B-Napthoflavone 
(Sigma; N3633-5G) was administered into 2-month-old Ah-cre mice three times 
within 24h as described previously", and mice were harvested 7 or 90 days later. 
B-Napthoflavone was also injected into pregnant female mice at 15.5 days post 
coitum for analysis of embryos at E18.5. Villus and crypt fractions were isolated as 
previously described*. Three independent samples from each genetic group were 
used for gene expression analysis by Affymetrix microarray. Analysis of gene 
expression data was performed using BRB-array tools developed by R. Simon 
and A. Peng Lam of the National Cancer Institute. Gene Ontologies were predicted 
by DAVID (Database for Annotation, Visualization and Integrated Discovery) 
Bioinformatics Resources at the National Institute of Allergy and Infectious 
diseases, NIH. X-gal staining, real-time RT-PCR, 5-bromodeoxyuridine (BrdU), 
ChIP and TUNEL assays were performed as previously described*”°. Primers for 
ChIP, real-time RT—PCR and genotyping are listed in Supplementary Fig. 19a, b. 
Antibodies used for western blot or immunohistochemical staining are listed in 
Supplementary Fig. 19c. 
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Mammalian SUMO E3-ligases PIAS1 and PIAS4 
promote responses to DNA double-strand breaks 


Yaron Galanty', Rimma Belotserkovskaya’, Julia Coates’, Sophie Polo’, Kyle M. Miller’ & Stephen P. Jackson’ 


DNA double-strand breaks (DSBs) are highly cytotoxic lesions that 
are generated by ionizing radiation and various DNA-damaging 
chemicals. Following DSB formation, cells activate the DNA-damage 
response (DDR) protein kinases ATM, ATR and DNA-PK (also 
known as PRKDC). These then trigger histone H2AX (also known 
as H2AFX) phosphorylation and the accumulation of proteins such 
as MDCI, 53BP1 (also known as TP53BP1), BRCAI, CtIP (also 
known as RBBP8), RNF8 and RNF168/RIDDLIN into ionizing 
radiation-induced foci (IRIF) that amplify DSB signalling and pro- 
mote DSB repair’. Attachment of small ubiquitin-related modifier 
(SUMO) to target proteins controls diverse cellular functions**. 
Here, we show that SUMO1, SUMO2 and SUMO3 accumulate at 
DSB sites in mammalian cells, with SUMO1 and SUMO2/3 accrual 
requiring the E3 ligase enzymes PIAS4 and PIAS1. We also establish 
that PIAS1 and PIAS4 are recruited to damage sites via mechanisms 
requiring their SAP domains, and are needed for the productive 
association of 53BP1, BRCA1 and RNF168 with such regions. 
Furthermore, we show that PIAS1 and PIAS4 promote DSB repair 
and confer ionizing radiation resistance. Finally, we establish that 
PIAS1 and PIAS4 are required for effective ubiquitin-adduct forma- 
tion mediated by RNF8, RNF168 and BRCAI at sites of DNA 
damage’ "'. These findings thus identify PIAS1 and PIAS4 as com- 
ponents of the DDR and reveal how protein recruitment to DSB sites 
is controlled by coordinated SUMOylation and ubiquitylation. 
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Figure 1| SUMOs and UBC9 accumulate at DNA-damage sites by 
mechanisms requiring MDC1, 53BP1 and BRCA1. a, U2OS cells or U2OS 
cells transfected with HA-SUMO1 or HA-SUMO3 were irradiated (5 Gy; 
+IR) or mock-irradiated (—IR) and probed. b, As in a but with laser 
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Mammalian cells express SUMO1 and the highly-related proteins 
SUMO2 and SUMO3 (SUMO2/3). These somewhat functionally- 
redundant proteins’’ are structurally related to ubiquitin and are 
covalently attached to target proteins by a SUMO-conjugation system 
consisting of an El activating enzyme (SAE1/SAE2), an E2 ligase 
(UBC9, also known as UBE2I) and various E3 ligases with differing 
target-protein specificities**. Involvement of the SUMO pathway in 
aspects of the DDR was previously reported (for review, see ref. 5). 
Notably, we found that, whereas SUMO] exhibited pan-nuclear stain- 
ing in untreated human cells, four hours after ionizing radiation treat- 
ment, it formed nuclear foci that largely co-localized with 53BP1, 
suggesting them to be IRIF (Fig. la). Similarly, transfected haemag- 
glutinin (HA)-epitope-tagged SUMO1 and SUMO3 formed IRIF 
(Fig. la; SUMO2/3 foci that do not co-localize with 53BP1 presumably 
reflect SUMO conjugates in other structures, including PML bodies). 
Next, we employed laser micro-irradiation to induce DNA-damage 
tracts (laser-lines) in living cells'*!*. This showed that endogenous 
SUMO] and SUMO2/3 (the antibody does not discriminate between 
these), together with HA-SUMO1 and HA-SUMO3, accumulated in 
laser-lines (Fig. 1b). Moreover, live imaging of cells containing green- 
fluorescent-protein (GFP)-tagged 53BP1 or red-fluorescent-protein 
(REP)-tagged SUMO1, SUMO2 or SUMO3 showed that all exhibited 
similar recruitment kinetics: accrual being detectable five minutes 
after micro-irradiation, peaking in intensity at two to four hours 
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micro-irradiation. c, U2OS cells co-transfected with GFP—UBC9 and 
REP-SUMO1 or RFP-SUMO2? were micro-irradiated and live cells imaged after 
20 min. d, e, U2OS cells transfected with siRNAs were laser micro-irradiated 
and probed. For siRNA depletions, see Fig. 3e and Supplementary Fig. 10a. 


'The Wellcome Trust and Cancer Research UK Gurdon Institute, and Department of Biochemistry, University of Cambridge, Tennis Court Road, Cambridge CB2 1QN, UK. 
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and then gradually diminishing (Supplementary Figs la—c, 2a and b). 
Furthermore, we observed SUMO1 and SUMO2/3 accumulation with 
varying intensities in both G, and S/G, cells (Supplementary Fig. 2c). 
Consistent with SUMOylation actively occurring at damage sites, 
UBC9 (the only known SUMO E2) accumulated at damaged regions 
with similar kinetics to SUMO (Fig. 1c and Supplementary Figs 1b, 1d 
and 2d). Furthermore, we observed faint recruitment of the SUMO El 
component, SAE], to laser-lines (data not shown), in accord with 
SAE] recently being identified as a potential ATM/ATR target’. 

In line with SUMO accumulation in IRIF and laser-lines represent- 
ing responses to DSBs, such accumulation was reduced when cells 
were pre-incubated with KU-55933, a specific ATM inhibitor’® (Sup- 
plementary Fig. 3a), whereas accumulation of SUMO1, and to a lesser 
extent SUMO2/3, was enhanced by depletion of CtIP or MMS21 (also 
known as NSMCE2), which promote DNA repair’””* (Figs 1d, e and 
Supplementary Figs 4a, b; see Fig. 3e for CtIP depletion and Sup- 
plementary Fig. 10 for other depletions). Furthermore, we observed 
markedly reduced SUMO1 and SUMO2/3 accumulation at damaged 
sites in cells that were defective in RNF168 or had been treated with 
short-interfering RNAs (siRNAs) to deplete MDC1 or RNF8 (Figs 1d 
and le, and Supplementary Figs 3b and 3c). Because MDC1, RNF8 
and RNF168 control the retention of 53BP1 and BRCA1 at DNA- 
damage sites’*''!°*’, we tested whether depleting these factors 
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affected SUMO accrual. Indeed, 53BP1 depletion impaired SUMO1 
but not SUMO2/3 accumulation in laser-lines (Figs 1d and le). 
Conversely, BRCA1 depletion abolished SUMO2/3 but not SUMO1 
accrual (Figs 1d and le). Collectively, these data suggested that DNA 
damage is channelled into 53BP1-SUMO1 or BRCA1-SUMO2/3 
pathways. 

The different accumulation requirements for SUMOI1 and 
SUMO2/3 suggested that their conjugation might require different 
E3 ligases. By siRNA depletion of various SUMO E3 ligases (Sup- 
plementary Fig. 10), we found that most were not required for 
SUMO] or SUMO2/3 accrual at DNA damage sites (Supplementary 
Figs 4a and 4b). Strikingly, however, depletion of the PIAS4 E3 ligase 
markedly reduced SUMO] accrual on laser-lines (Figs 2a and 2b; note 
that MDC1 recruitment still occurred). Nevertheless, in certain cells, 
PIAS4 depletion also impaired SUMO2/3 (and 53BP1) accumulation 
(Fig. 2b, bottom panels), indicating that PIAS4 controls both SUMO1 
and SUMO2/3 accrual. Accordingly, PIAS4 depletion impaired the 
accumulation of GFP—SUMO3 at laser-lines (data not shown). In 
parallel, we found that PIAS1 depletion markedly reduced SUMO2/3 
accumulation at sites of DNA damage in all cells, but did not affect 
SUMO] accrual (Figs 2a, 2b and Supplementary Figs 4a and 4b). 
Supporting a model in which PIAS4 and PIAS1 mediate SUMO con- 
jugation at DSB sites, RFP-tagged PIAS4 and GFP-tagged PIAS1 were 
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Figure 2 | PIAS1 and PIAS4 are recruited to DNA-damage sites and 
mediate 53BP1 recruitment and SUMOylation. a, b, U2OS cells stably 
expressing GFP—MDCI were treated, micro-irradiated and probed 
(Supplementary Figs 4c and 10a—c for quantifications and depletions, 
respectively). c, Cells expressing GFP—PIAS1 or RFP—PIAS4 were micro- 
irradiated and probed. d, Cells expressing GFP—PIAS1 or RFP—PIAS4 wild- 
type, ligase dead (LD), delta SAP (data not shown) or ligase-dead delta SAP 
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(LDASAP) were micro-irradiated and imaged. e, Cells stably expressing 
GFP-53BP1 subjected to FRAP (n = 11 independent measurements; error 
bars = s.d.). fh, U2OS cells stably expressing GFP—SUMO1 or GFP, MRE11 
and Tubulin used as loading control (f, g), or HEK293 cells co-transfected 
with full-length (1-1972) HA-53BP1 and GFP-SUMO1 or GFP (h) were 
treated with or without IR (10 Gy). IB, Immunoblotting. 
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recruited to laser-lines with similar kinetics to SUMO and UBC9 
(Fig. 2c and 2d). Furthermore, for both PIAS4 and PIAS1, recruitment 
required their N-terminal SAP domain—originally defined as a DNA/ 
RNA binding motif**—but was not impaired by mutations predicted 
to abolish their SUMO E3-ligase functions (Fig. 2d). When expressed 
alone, however, the SAP domains of PIAS] and PIAS4 were not detect- 
ably recruited to laser-lines, revealing that additional parts of these 
proteins are required for effective recruitment (data not shown). 
Strikingly, PIAS4 depletion by either of two independent siRNA 
oligonucleotides, but not depletion of any other E3 enzyme tested, 
severely impaired 53BP1 accumulation in laser-lines and in IRIF 
(Figs 2a, b and Supplementary Figs 4a—d; demonstration of siRNA 
specificity is provided by use of a point-mutated siRNA-resistant 
PIAS4 construct in Supplementary Figs 5a and b). In accord with this, 
UBC9 depletion impaired 53BP1 accumulation, while histone H2AX 
phosphorylation (y-H2AX) and MDC1 recruitment still ensued 
(Fig. 2a). Furthermore, fluorescence-recovery after photo-bleaching 
(FRAP) assays established that PIAS4 depletion significantly reduced 
the residence time of 53BP1 in laser-lines and increased the mobile 
fraction of 53BP1 molecules in these locations (Fig. 2e; representative 
images are shown in Supplementary Fig. 5d). By contrast, RFP—PIAS4 
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Merge/DAPI 


LETTERS 


accrual in laser lines was not impaired by 53BP1 depletion 
(Supplementary Fig. 5c), implying that PIAS4 acts upstream of 53BP1. 

During the above studies, we noted that the ionizing radiation- 
induced shift in 53BP 1 electrophoretic mobility on SDS-polyacrylamide 
gels was reduced by PIAS4 depletion (data not shown). Consistent 
with this mobility shift at least in part reflecting 53BP 1 SUMOylation, 
the migration of endogenously expressed 53BP1 on SDS-PAGE was 
shifted yet further in cells expressing GFP-tagged SUMO] (Fig. 2f); 
furthermore, this shift was diminished by PIAS4 depletion but not by 
PIAS1 depletion (Fig. 2g). To test directly for 53BP1 SUMOylation, 
we transiently co-expressed HA-tagged 53BP1 with GEP-SUMO1 or 
GFP in cells, then performed GFP immunoprecipitations. Western 
immunoblotting of resulting samples with an anti-HA antibody 
established that 53BP1 was indeed SUMOylated in an ionizing 
radiation-inducible manner (Fig. 2h), a conclusion supported by 
reciprocal immunoprecipitation-western experiments (Supplementary 
Fig. 6a) and by experiments with endogenous 53BP1 and SUMO1 
(Supplementary Fig. 6b; this also showed that 53BP1 SUMOylation 
was reduced by depleting PIAS4 but not PIAS1). Studies with cells 
expressing 53BP1 truncations revealed that both the amino terminal 
(residues 1-1052) and carboxy terminal (1052-1972) regions of 53BP1 
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Figure 3 | PIAS1 and PIAS4 promote BRCA1 accumulation and 
SUMOylation, RPA phosphorylation, and DSB repair. a, U2OS cells treated, 
micro-irradiated and probed as indicated; representative images with % of 
y-H2AX positive cells also positive for BRCA1, each image represents >200 
y-H2AX-positive cells in two independent experiments. b, U2OS cells stably 
expressing GFP-BRCA1 and Flag-BARD1 were subjected to FRAP; data 
from luciferase (Luc, n = 7 independent measurements), PIAS1 (n = 8) and 
PIAS4 (n = 11); error bars = s.d. ¢, Essentially as Fig. 2h, except cells were 


co-transfected with GFP-SUMO1, HA-BRCA1 and Flag—BARD1. IP, 
Immunoprecipitation. MRE11 (also known as MRE11A) was used as loading 
control. d, e, Extracts were prepared and analysed 2 h following mock (-) or 
10 Gy ionizing radiation treatment. NBS1 (also known as NBN) was used as 
loading control. f-h, Effects of PIAS1/4 depletion on homologous 
recombination-mediated gene-conversion (f), NHEJ (g) and ionizing 
radiation sensitivity (h); error bars = s.d.; data accumulated over four 
independent experiments (in each of f-h). 
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can be SUMOylated and suggested that C-terminal SUMOylation occurs 
between residues 1052 and 1710 (Supplementary Figs 6c—e). While these 
data indicated that DNA-damage-induced 53BP1 SUMOylation occurs, 
we note that this cannot account for all the PIAS-dependent 
SUMOylation signals observed in IRIF or laser-lines. Consequently, 
there must be additional DDR factors (some of which might have been 
identified in previous studies’) that are targeted for DNA-damage 
induced, PIAS-mediated SUMOylation. 

In parallel work, we found that both PIAS] or PIAS4 depletion 
reduced the proportion of damaged (y-H2AX-positive) cells displaying 
BRCAI accumulation and decreased BRCA1 staining intensity in those 
cells still exhibiting BRCA1 accrual (Fig. 3a and Supplementary Fig. 7b; 
cells with weak BRCAI staining in Fig. 3a were counted positive). By 
employing cDNA complementation studies, we established that BRCA1 
accrual required the SAP domain and E3-ligase activity of PIAS1 
(Supplementary Figs 7c-e). Furthermore, FRAP analyses revealed that 
PIAS] or PIAS4 depletion reduced the residence time of GFP—BRCA1 at 
damaged sites and increased the mobile fraction of BRCA1 molecules 
(Fig. 3b; see representative images in Supplementary Fig. 7b). Through 
using epitope-tagged SUMO2 and BRCAI in immunoprecipitation- 
western studies, we also established that BRCA1 is SUMOylated and 
that this is enhanced upon ionizing radiation treatment (Fig. 3c). 
Accordingly, probing western blots of BRCA1 immunoprecipitates 
for SUMO2/3 revealed that ionizing radiation enhanced BRCA1 
SUMOylation ina manner promoted by both PIAS] and PIAS4 (Fig. 3d). 

DSBs can be processed into single-stranded DNA that is bound by 
replication protein A (RPA) to promote ATR signalling and DSB repair 
by homologous recombination’’. Notably, RPA accumulation in laser- 
lines (whether normalized or not to cell cycle profiles in Supplementary 
Fig. 10e) was impaired by PIAS1 or PIAS4 depletion (Supplementary 
Figs 8a—c). Furthermore, phosphorylation of the 34kDa subunit of 
RPA on Ser4 and Ser 8 (pS4/pS8) in response to ionizing radiation 
or camptothecin treatment was diminished by PIAS4 depletion, 
whereas PIAS1 depletion impaired ionizing radiation-induced but 
not camptothecin-induced RPA phosphorylation (Fig. 3e; CtIP deple- 
tion also impaired RPA phosphorylation, as previously reported’’). 
Consistent with these findings and the involvement of BRCA1 and 
RPA in DNA repair by homologous recombination’”*”*, PIAS1 or 
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Figure 4 | Linkage between PIAS1/4 and RNF8/168. a, U2OS cells were 
treated and probed as indicated. b, c, As (a) except cells stably expressed 
GFP-RNF8 or GFP-BRCA1/Flag—BARD1; see Supplementary Figs 9a and b 
for quantifications. d, U2OS cells were treated and probed as indicated; see 
Supplementary Figs 9d and 10c for quantifications and siRNA efficiencies, 
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PIAS4 depletion reduced homologous recombination in a cell-based 
gene conversion assay” (Fig. 3f). PIAS] and PIAS4 depletion also 
impaired DSB repair by non-homologous end-joining (NHE)J) as 
assessed by a cell-based plasmid-integration assay** (Fig. 3g) and 
resulted in ionizing radiation hypersensitivity (Fig. 3h). 

Accumulation of 53BP1, BRCA1 and ubiquitin conjugates at DSB 
sites requires the ubiquitin E3 ligases, RNF8 and RNF168, which 
ubiquitylate histones H2A and H2AX’°"''”?, Furthermore, it has 
been reported that in both Caenorhabditis elegans and mammalian 
cells, ubiquitin-conjugate formation at DNA-damage sites requires 
BRCAI1 E3-ubiquitin ligase activity'®”, although other groups have 
reported the effect of BRCA1 depletion on ubiquitin accrual to be 
only minor’*"’. In our assays, we found that, as for BRCA1 depletion, 
PIAS] or PIAS4 depletion dramatically impaired ubiquitin-conjugate 
accumulation (as detected by the FK2 antibody) in laser-lines, while 
GFP-RNF8 accumulation appeared normal (Figs 4a, b and Sup- 
plementary Fig. 9a). Furthermore, PIAS4 depletion but not PIAS1 
depletion markedly impaired histone H2A ubiquitylation at damaged 
sites (Fig. 4c and Supplementary Fig. 9b). Consistent with PIAS4 being 
required for DNA-damage-induced accrual of 53BP1, BRCA1, FK2- 
ubiquitin conjugates and ubiquitin—H2A, the recruitment of endo- 
genous RNF168 to damaged regions was impaired in PIAS4 depleted 
cells (Fig. 4d and Supplementary Figs 9c and 9d). By contrast, RNF168 
still assembled at damage sites in PIAS1-depleted cells (Fig. 4d; as 
shown previously”'’, RNF168 accrual was RNF8 dependent). 
Because 53BP1 still accumulated under conditions where the FK2— 
ubiquitin signal was severely impaired (upon BRCA1 or PIAS] deple- 
tion; Figs 1d, le, 2a, 2b and 4a), these data implied that 53BP1 recruit- 
ment does not require ubiquitin conjugates recognized by the FK2 
antibody but, instead, relies on other ubiquitylated proteins (most 
likely H2A and H2AX). Significantly, depletion of RNF8 or 
RNF168, although abolishing 53BP1 accrual at sites of DNA damage, 
did not affect accumulation of GFP—PIAS1 or RFP—PIAS4 (Figs 4e and 
4f). We therefore conclude that PIAS1 and PIAS4 function in parallel 
with RNF8 to orchestrate RNF8-, RNF168- and BRCA1-dependent 
accumulation of ubiquitin conjugates at DNA-damage sites. Only 
PIAS4, however, is needed for RNF8- and RNF168-mediated H2A 
and possibly H2AX ubiquitylation. 
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respectively. e, f, U2OS cells stably expressing GFP—PIAS1 (e) or RFP—PIAS4 
(f) were treated and probed as indicated. g, Model; dashed arrows indicate 
protein requirements for accumulation, solid arrows indicate target-protein 
modifications. Ub, ubiquitin. $, SUMO. 
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Our findings invoke a model in which PIAS1 and PIAS4 act in 
parallel but overlapping SUMO-conjugation pathways to control the 
DDR (Fig. 4g). In this regard, we note that mouse knockout studies 
have revealed that PIAS1 or PIAS4 loss is tolerated, whereas deletion of 
both leads to embryonic lethality and an inability to derive viable 
cells*’. Significantly, whereas both PIAS1 and PIAS4 promote FK2— 
ubiquitin-adduct accumulation, only PIAS4 is needed for accrual of 
RNF168 and ubiquitylated H2A at DNA-damage sites. An attractive 
explanation for these and other data is that, after being recruited by 
RNF8-, PIAS1- and PIAS4-dependent mechanisms, BRCA1 (together 
with BARD 1) is itself the major ubiquitin E3 ligase for generating FK2- 
reactive ubiquitin conjugates. Significantly, after PIAS] or PIAS4 
depletion, we still detect weak association of BRCA1 at damage sites 
but not the accumulation of BRCA1-dependent FK2-ubiquitin con- 
jugates. Consequently, we speculate that PIAS1- and PIAS4-dependent 
SUMOylation of BRCA1—and in all likelihood various other DDR 
proteins—not only mediates the stable association of BRCA1 with 
DNA-damage sites but also promote BRCA1 ubiquitin-ligase activity. 
Furthermore, we found that GFP—RNF8 recruitment still occurred 
upon PIAS1 or PIAS4 depletion, showing that RNF8 recruitment is 
insufficient to effectively recruit RNF168 and mediate effective ubiqui- 
tin-conjugate production at DSB sites. Thus, we speculate that RNF8, 
RNF168 and/or BRCA1/BARD1 might require pre-SUMOylation of 
their targets and/or that SUMOylation regulates their ubiquitin-ligase 
activities. Future studies will be required to define the precise mechan- 
isms by which the ubiquitin- and SUMO-conjugation systems coop- 
erate at DSB sites, and determine how PIAS] and PIAS4 impinge on 
chromatin structure, promote DSB signalling and repair, and poten- 
tially regulate yet other aspects of the DDR. 


METHODS SUMMARY 


U20S-based cell lines were maintained under standard conditions. cDNA clon- 
ing was by standard procedures. siRNA transfections were with Lipofectamine 
RNAiMAX (Invitrogen). Ionizing radiation was administered with a Faxitron 
X-ray machine (Faxitron X-ray Corporation). ATM inhibition was by KU-55933 
(KuDOS Pharmaceuticals). Laser micro-irradiation was with a FluoView 1000 
confocal microscope (Olympus) with 37 °C heating stage (Ibidi) and 405 nm 
diode (6 mW). FRAP was performed when laser-track accumulation of GFP- 
tagged protein reached maximal steady-state level. For immunofluorescence, 
cells were pre-extracted or not, fixed with 2% paraformaldehyde, permeabilized 
and stained. For whole cell extracts, cells were lysed on plates with 2% SDS, 
50 mM Tris-HCl pH 7.5, 20mM N-ethylmaleimide (Sigma-Aldrich) and prote- 
ase inhibitor cocktail (Roche). To immunoprecipitate 53BP1, BRCA1 and 
SUMOylated proteins, different lysis and binding buffers were used (Sup- 
plementary Information). Homologous recombination and NHEJ assays were 
performed as described previously’””*. For ionizing radiation survival, cells were 
transfected with siRNA and exposed to ionizing radiation. After 10-14 days, 
colonies were stained with 0.5% crystal violet/20% ethanol, counted and nor- 
malized to plating efficiencies. For fluorescence-activated cell sorting of propi- 
dium iodide-stained cells, data were analysed by FlowJo software. All error bars 
represent s.d. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Cell culture. U2OS cells were grown in DMEM (Sigma-Aldrich) supplemented 
with 10% fetal bovine serum (Biosera), 100 units/ml of penicillin, and 
100mg ml ' of streptomycin (Sigma-Aldrich). U2OS cells stably expressing 
GFP-MDCI (ref. 1), GFP-53BP1, GFP-CtIP (ref. 2), GFP—RNF8, GFP- 
SUMO1, GFP-SUMO2, GFP-SUMO3, GFP—UBC9, GFP-PIAS1 (wild type, 
LD, ASAP and LDASAP) and RFP—PIAS4 (wild type, wild type siRNA resistant, 
LD, ASAP and LDASAP) were grown in standard U2OS medium supplemented 
with 1 mg ml! of G418 (Gibco, Invitrogen). U2OS cells stably expressing GFP— 
BRCAI and Flag—BARD1 (ref. 3) were provided by C. Lukas and were grown 
with 0.4 mg ml! of G418. hTERT RIDDLE syndrome fibroblasts complemented 
with vector or HA-RIDDLIN/RNF168 (ref. 4) were provided by D. Durocher and 
G. Stewart, and were grown in standard medium supplemented with 0.5 mg ml! 
of G418. 

siRNA transfection and sequences. siRNA duplexes were obtained from MWG 
biotech or QIAGEN (Supplementary Table 1). Two consecutive rounds of siRNA 
transfections were carried out with Lipofectamine RNAiMAX (Invitrogen) 
according to the manufacturer’s protocol unless otherwise specified. siRNA- 
transfected cells were assayed 48h after transfection. For co-transfection with 
siRNA and expression constructs, cells were first transfected with siRNA followed 
by plasmid transfection 24h later by using Fugene6 (Roche) according to manu- 
facturer’s protocol. Cells were assayed 48h after plasmid transfections. siRNA- 
mediated downregulation of overexpressed protein was achieved by a first round 
of siRNA transfection as described above with an additional siRNA transfection 
24h after plasmid transfection. All PIAS4 siRNA mediated down regulation 
experiments were carried out using PIAS4-1 siRNA unless otherwise specified. 
Laser micro-irradiation and imaging of live and fixed cells. For generation of 
localized damage in cellular DNA by exposure to an ultraviolet-A laser beam™*, 
cells were plated on glass-bottomed dishes (Willco-Wells) and pre-sensitized 
with 10 uM 5-bromo-2’-deoxyuridine (BrdU, Sigma-Aldrich) in phenol red-free 
medium (Invitrogen) for 24h at 37 °C. Laser micro-irradiation was performed 
by using a FluoView 1000 confocal microscope (Olympus) equipped with a 
37°C heating stage (Ibidi) and a 405 nm laser diode (6 mW) focused through 
a X60 UPlanSApo/1.35 oil objective to yield a spot size of 0.5—1 jum. The time of 
cell exposure to the laser beam was around 250 ms (fast scanning mode). Laser 
settings (0.40 mW output, 50 scans) were chosen that generate a detectable 
damage response restricted to the laser path in a pre-sensitization-dependent 
manner without noticeable cytotoxicity. Imaging of live and fixed cells was done 
on the same microscope by using the objective lens and software described 
above. 

Fluorescent recovery after photobleaching (FRAP). FRAP analyses were per- 
formed on the microscope used for laser micro-irradiation when the accumula- 
tion of the GFP-tagged protein on the laser track reached its maximal steady- 
state level. After a series of three pre-bleach images, a rectangular region placed 
over the laser-damaged line was subjected to a bleach pulse (five scans with 
488 nm argon laser focused through a X60 UPlanSApo/1.35 oil objective, main 
scanner, 100% AOTF acousto-optical tunable filter, slow scanning mode), 
followed by image acquisition in 5 s intervals for GFP—53BP1 and at fastest speed 
for GFP-BRCAI. Average fluorescent intensities in the bleached region were 
normalized against intensities in an undamaged nucleus in the same field 
after background subtraction to correct for overall bleaching of the GFP signal 
due to repetitive imaging. For mathematical modelling of GFP-tagged protein 
mobility, (I, - Jo)/Ipre values were plotted as a function of time, where Ip is the 
fluorescence intensity immediately after bleaching and I,,. is the average of the 
three pre-bleach measurements. Estimation of mobile protein fraction (A) and 
residence time (t) were performed using Prism 4 software assuming the existence 
of one protein population using the following equation: y(t) = A(1 - exp(-#/t)). 
Immunofluorescence. Cells were washed three times with PBS 0.1% Tween-20 
followed by pre-extraction for 10 min (pre-extraction buffer: 25 mM HEPES 7.4, 
50 mM NaCl, 1 mM EDTA, 3 mM MgCh, 300 mM sucrose and 0.5% TritonX-100). 
Cells were fixed with 2% formaldehyde (w/v) in PBS for 20 min. Following three 
washes with PBS 0.1% Tween-20, cells were blocked for 1h with 5% BSA in PBS 
0.1% Tween-20 co-stained with the appropriate antibodies (Supplementary Table 
3) in blocking solution over night and then co-immunostained with the appropriate 
secondary antibodies (Supplementary Table 3) in blocking solution. For imaging 
RNF168/RIDDLIN pre-extraction buffer: 20 mM HEPES, 20mM NaCl, 5mM 
MgCl, 1 mM DTT, 0.5% NP-40, serine/threonine phosphatases inhibitor cocktail 
(Sigma-Aldrich), protease inhibitor cocktail (Roche). Pre-extraction step was 
omitted and permeabilization (0.5% Triton X-100 in PBS) was performed after 
fixation for imaging of UBC9, PIAS1, PIAS4, CtIP and RNF8. 

Treatment with small-molecule inhibitors and DNA-damaging agents. 
Camptothecin was obtained from Sigma-Aldrich, ATM KU-55933 inhibitor 
was provided by KuDOS Pharmaceuticals. Ionizing radiation treatment was 
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performed by using a Faxitron X-ray machine (Faxitron X-ray Corporation). 
Where appropriate, ATM inhibitor (20 1M) was applied to the culture medium 
1h before laser micro-irradiation. 

Plasmids and cloning. SUMO1, SUMO2, SUMO3, UBC9, PIAS1, PIAS2, PIAS3 
and PIAS4 were PCR amplified from a human fetal brain cDNA library and 
cloned into pCS2-mREFP (R. Y. Tsien) and pEGFP-C1 (Clontech). PIAS1ASAP 
and PIAS4ASAP were sub-cloned from the original clones whereas PIAS1C351A 
and PIAS4C342A/C347A were created using a QuikChange Site-Directed 
Mutagenesis kit (Stratagene). PIAS4 siRNA resistant clone was obtained by 
inserting the 7-nucleotide mismatches underlined (GATCCAAA 
GTCCGGACTGAA) into PIAS4 cDNA using a QuikChange Site-Directed 
Mutagenesis kit (Stratagene). MMS21 was cloned into pCL-NCX (Imgenex) 
initially modified to contain a 7His-3XFlag tag using an adaptor duplex. 
RNFS8 was cloned into pCDNA3.1 (Invitrogen) initially modified to contain 
GFP. PIAS] and PIAS4 were also cloned into pCDNA3.1(-) (Invitrogen) initially 
modified to contain 3XFlag-S-tag in the same reading frame as pEGFP-Cl. 
Mammalian expression plasmids encoding HA-53BP1 (full length, N, C, 
CABRCT and BRCT) were provided by K. Iwabuchi. Mammalian expression 
plasmids encoding Flag~-BARD1 and HA-BRCAI were provided by R. Baer and 
P. Harkin respectively. Primers were obtained from Sigma-Aldrich (Supplemen- 
tary Table 2). 

Immunoprecipitation and immunoblotting. Cell extracts were prepared on 
plates by using lysis buffer containing 2% SDS, 50mM Tris-HCl pH7.5, 
20mM_ N-ethylmaleimide (Sigma-Aldrich) and protease inhibitor cocktail 
(Roche). Sonicating or passing the extracts 10 times through a 19G needle 
mounted syringe reduced viscosity. For 53BP 1 immunoprecipitation, cell extracts 
were prepared as mentioned above and then diluted 1:20 with lysis buffer contain- 
ing 150mM NaCl and 1% NP40 instead of SDS. For BRCA1 immunoprecipita- 
tion, cell extracts were prepared by using lysis buffer containing 20 mM HEPES 
pH 7.4, 450 mM NaCl, 1.5mM MgCl, 1 mM EGTA, 1% Tween20, 10% glycerol, 
serine/threonine phosphatase inhibitor cocktail (Sigma-Aldrich), protease inhi- 
bitor cocktail (Roche) and 10mM N-ethylmaleimide (Sigma-Aldrich), the 
extracts were then sonicated and diluted 1:2 with the same buffer lacking NaCl. 
For GFP and HA-53BP1 immunoprecipitation using GFP-Trap-A (ChromoTek) 
and anti haemagglutinin antibodies, cell extracts were prepared by using lysis 
buffer containing 20 mM HEPES pH 7.4, 500mM NaCl, 1.5mM MgCl, 1mM 
EGTA, 1% Triton X-100, 10% glycerol, serine/threonine phosphatase inhibi- 
tor cocktail (Sigma-Aldrich), protease inhibitor cocktail (Roche) and 10 mM 
N-ethylmaleimide (Sigma-Aldrich), the extracts were then sonicated and 
diluted 1:2 with the same buffer lacking NaCl. In all cases the extracts were 
cleared using centrifugation at 16,000g for 45min at 4°C. Antibodies 
against 53BP1 (Sigma-Aldrich) and BRCA1 (Santa-Cruz 1:1 mix of rabbit poly- 
clonals, same mix was used for immunoblotting) were pre-bound to Dynabeads 
ProteinG (Invitrogen) and incubated for 2h at room temperature (53BP1) or 
overnight at 4°C (BRCA1, GFP-Trap-A and HA) followed by five washes with 
immunoprecipitation buffer (two washes with lysis buffer were added for the 
GFP-Trap-A immunoprecipitation) and 5 min boiling in 1.5 SDS sample buf- 
fer. Proteins were resolved by 4—18% gradient SDS-PAGE (unless otherwise 
specified) and transferred to PVDF membrane (GE Healthcare). Immu- 
noblotting was performed with the appropriate antibodies (Supplementary 
Table 3). 

Random plasmid integration assay. Assays were performed as previously 
described’ with minor modifications. Briefly, one day after transfection with 
siRNA, U20S cells were transfected with BamHI-Xhol linearized pEGFP-C1 
(Clontech). The following day, cells were collected, counted and plated on three 
plates, one of which contained 0.5 mg ml ' G418. One day after plating, the cells 
ona plate lacking G418 were fixed to assess transfection efficiency and the other 
two plates were incubated for 10-14 days at 37 °C to allow colony formation. 
Colonies were stained with 0.5% crystal violet/20% ethanol and counted. 
Random-plasmid integration events were normalized to transfection and plating 
efficiencies. 

Homologous recombination assay. A U2OS clone with the integrated homo- 
logous recombination reporter DR-GFP was generated as described previ- 
ously**. One day after transfection with siRNA, U2OS-DR-GEP cells were 
co-transfected with an I-Scel expression vector (pCBA-I-Scel) and a vector 
expressing monomeric red fluorescent protein (pCS2-mRFP). The latter plasmid 
was added ina 1:5 ratio to mark the I-Scel-positive cells. Cells were harvested one 
day after I-Scel transfection and subjected to flow cytometric analysis to examine 
recombination induced by I-Scel digestion. Only RFP-positive cells were analysed 
for homologous recombination efficiency to circumvent possible differences in 
transfection efficiencies. Fluorescence-activated cell sorting data were analysed by 
using Summit V4.3 software to reveal the percentage of GFP-positive cells relative 
to the number of transfected cells (RFP positive). The data were related to a 
control siRNA treatment in each individual experiment. The dividing line 
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between GFP (homologous recombination) positive and negative cells was set to 
0.5% background level of GFP-positive cells in the internal control (RFP positive, 
not transfected with I-Scel). This gate was then applied to the RFP/I-Scel positive 
samples to determine homologous recombination efficiency. Results are pre- 
sented as a percentage of control siRNA. 

Ionizing radiation survival assays. U2OS cells were transfected with siRNA and 
exposed to ionizing radiation. Cells were left for 10-14 days at 37°C to allow 


nature 


colony formation. Colonies were stained with 0.5% crystal violet/20% ethanol 
and counted. Results were normalized to plating efficiencies. 
Florescence-activated cell sorting (FACS). To determine cell-cycle distribution, 
cells were fixed with 70% ethanol, incubated for 30min with RNase A 
(250pgml~') and propidium iodide (10 1gml~') at 37°C and analysed by 
FACS. Data were analysed by using FlowJo software to reveal the percentage 
of cells in each phase of the cell cycle. 
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Coordinating DNA replication by means of priming 
loop and differential synthesis rate 


Manjula Pandey’, Salman Syed’, IIker Donmez’, Gayatri Patel’, Taekjip Ha** & Smita S. Patel! 


Genomic DNA is replicated by two DNA polymerase molecules, one 
of which works in close association with the helicase to copy the 
leading-strand template in a continuous manner while the 
second copies the already unwound lagging-strand template in a 
discontinuous manner through the synthesis of Okazaki frag- 
ments'*. Considering that the lagging-strand polymerase has to 
recycle after the completion of every Okazaki fragment through 
the slow steps of primer synthesis and hand-off to the polymerase*”, 
it is not understood how the two strands are synthesized with the 
same net rate*°. Here we show, using the T7 replication proteins'’°"', 
that RNA primers are made ‘on the fly during ongoing DNA syn- 
thesis and that the leading-strand T7 replisome does not pause 
during primer synthesis, contrary to previous reports’”’’. Instead, 
the leading-strand polymerase remains limited by the speed of the 
helicase’; it therefore synthesizes DNA more slowly than the 
lagging-strand polymerase. We show that the primase—helicase 
T7gp4 maintains contact with the priming sequence during 
ongoing DNA synthesis; the nascent lagging-strand template 
therefore organizes into a priming loop that keeps the primer in 
physical proximity to the replication complex. Our findings provide 
three synergistic mechanisms of coordination: first, primers are 
made concomitantly with DNA synthesis; second, the priming loop 
ensures efficient primer use and hand-off to the polymerase; and 
third, the lagging-strand polymerase copies DNA faster, which 
allows it to keep up with leading-strand DNA synthesis overall. 

To investigate the functional cooperativity between the enzymatic 
activities of the T7 replication complex, we measured the kinetics of 
DNA unwinding, DNA synthesis and primer synthesis on synthetic 
replication-fork substrates with and without the T7 priming 
sequence (3’-CTGGG-5’; Supplementary Table 1). Efficient syn- 
thesis of RNA primers from dimer to pentamer by T7 replisome 
(T7 gp4 and T7 DNA polymerase) was observed on the priming fork 
(Fig. la and Supplementary Fig. 1) with a half-life of about 0.5 s anda 
yield of more than 60% (Fig. 1a, right). T7 gp4 alone also makes RNA 
primers on this priming fork, but roughly tenfold more slowly 
(Supplementary Fig. 1), which is consistent with polymerase assist- 
ance of the helicase rate’*. An average 46% yield of primer synthesis 
with forks of different lengths and sequences (Supplementary Table 2) 
indicates that T7 replisome lays down primers on newly unwound 
lagging-strand template with a high efficiency. In addition, the newly 
made primers are elongated through lagging-strand DNA synthesis 
(Supplementary Fig. 1). 

All-or-none DNA strand separation assays'*'° under primer syn- 
thesis conditions show that T7 replisome unwinds the priming fork 
and the control fork (without the priming sequence) with similar rate 
constants at all dTTP concentrations (Supplementary Fig. 2). Single- 
molecule fluorescence resonance energy transfer (FRET) unwinding 
assays’ show an increase in fluorescence intensity of Cy3 (donor, 


15,16 


green) and a decrease in Cy5 (acceptor, red) fluorescence intensity as 
a result of DNA unwinding and synthesis (Fig. 1b), and the priming 
and control forks show a similar FRET decrease time (At) of 
0.4 + 0.027 and 0.37 + 0.022 s, respectively. By comparing the histo- 
grams of FRET values visited during the unwinding reactions 
between the T7 replisome reaction and the roughly threefold slower 
reaction by T7 gp4 alone (Supplementary Fig. 3), we further con- 
firmed that the T7 replisome does not pause during primer synthesis 
(Supplementary Fig. 6). Ifthe T7 replisome paused for several seconds 
every time a primer is made’’, our single-molecule analysis would 
have detected the pausing events. 

To investigate whether DNA synthesis was occurring concomi- 
tantly with primer synthesis, the kinetics of strand-displacement 
DNA synthesis was measured on the priming and control forks under 
the same reaction conditions as in Fig. la. In the high-resolution 
sequencing gels used to analyse the DNA synthesis kinetics, any 
pausing of the T7 replisome activity as a result of primer synthesis 
would be detected as an accumulation of specific DNA products in 
the priming-fork reactions, but not in the control. However, no 
unusual accumulation of intermediate DNA products as a result of 
replisome pausing was observed with the priming-fork template in 
comparison with the control (Fig. 1c). Consistent with this result, the 
average DNA elongation rates on priming (126 + 9 nucleotides s ') 
and control (113 = 8 nucleotides s ') forks were similar (Fig. 1c). No 
pausing was detected on longer forks (Supplementary Figs 1 and 8) or 
on forks with different GC contents or at different dNTP concentra- 
tions (Supplementary Table 3). Finally, coupled leading-strand and 
lagging-strand DNA synthesis measured by the rolling-circle assay 
with T7 gp2.5 showed no effect of lagging-strand synthesis on the rate 
of leading-strand synthesis (Supplementary Fig. 9). Similar observa- 
tions have been made with the T4 replication proteins’’, although a 
recent study of Escherichia coli replication yielded a different result'’. 
Overall, our results indicate that DNA synthesis continues uninter- 
rupted while RNA primers are laid down, and the leading-strand 
polymerase does not slow as a function of primase activity or as a 
result of any of the steps during lagging-strand polymerase recycling. 

Because DNA synthesis continues uninterrupted while primers are 
being synthesized, our results predict that the nascent lagging-strand 
template should loop out between the covalently linked helicase and 
primase domains of T7 gp4 (Fig. 2A)°. The formation of such a prim- 
ing loop during DNA synthesis was probed with single-molecule 
FRET experiments: Cy3 and Cy5 fluorophores were introduced 40 
base pairs (bp) apart on the lagging-strand template of the surface- 
attached DNA fork (Fig. 2B). Before the unwinding of DNA, no FRET 
was observed because of the long (40-bp) distance between the fluor- 
ophores (Fig. 2B, C, a). As T7 replisome unwinds the double-stranded 
DNA (dsDNA), the donor Cy3 shows an increase in intensity (green 
trace in Fig. 2C, top panel) due to a change in environment from 
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protein-induced fluorescence enhancement’! and DNA strand sepa- 
ration” (Fig. 2B, b). Continued DNA unwinding brings the priming 
sequence and the donor nearby close to the primase domain, where 
they are held in place. The replisome continues unwinding the DNA 
while the primase domain is engaged with the priming sequence; 
therefore, at some point in time, the acceptor comes close to the donor 
(Fig. 2B, c) and this was detected as an increase in FRET (Fig. 2C, c, top 
and middle panels), providing evidence for the formation ofa priming 
loop. In all, 40 molecules out of about 75 with a fluorescently active 
donor and acceptor showed formation of a priming loop. With con- 
tinued unwinding, the priming loop grows in size and the donor and 
acceptor move apart (Fig. 2B, d), which was detected as a decrease in 
FRET (Fig. 2C, d, top and middle panels). Finally, the total fluore- 
scence signal disappears on completion of the reaction and the release 
of the fluorescently labelled DNA strand from the surface. 

The control fork showed an increase in donor intensity (green) 
(Fig. 2C, bottom panel) but no increase in acceptor intensity or 
FRET (more than 200 molecules were analysed). The donor intensity 
time (Fig. 2D, bottom panel)—the time between the jump in donor 
intensity and the total signal disappearance—was the same for priming 
and control DNAs, indicating that both DNAs are unwound at the 
same rate. The average time for FRET increase (Fig. 2D, top panel) with 
the priming fork was threefold longer for the experiments with T7 gp4 
only (Supplementary Fig. 11) compared with T7 replisome, indicating 
the assistance of the polymerase in the reactions'*. To test whether 
compaction of unwound single-stranded DNA (ssDNA) might give 
rise to high FRET values”, we measured FRET between Cy3 and Cy5 
separated by 40 nucleotides of ssDNA and found the average FRET 
value to be 0.2, much lower than those obtained during priming-loop 
formation (Supplementary Fig. 12). Overall, these results show, first, 
that the primase remains engaged with the priming sequence while 


Nucleotides added 


DNA continues to be synthesized, and second, that the nascent lagging 
strand forms a priming loop. 

Making primers ahead of time during ongoing DNA synthesis 
minimizes the delay due to primer synthesis, and keeping the RNA 
primers in physical proximity to the replicating complex provides a 
mechanism for efficient use of the priming sequence and hand-off to 
the polymerase. Nevertheless, lagging-strand polymerase dissoci- 
ation, primer hand-off and initiation of a new Okazaki fragment 
synthesis event take time that could delay the synthesis of the lagging 
strand. Because the leading-strand replisome does not slow or pause 
during primer synthesis, the question remains as to how the lagging- 
strand polymerase keeps up with the leading-strand polymerase after 
many Okazaki fragment synthesis events. We therefore tested an 
alternative model, proposed more than 20 years ago™, that the lead- 
ing-strand polymerase simply moves with a slower overall rate than 
the lagging-strand polymerase. 

The transient-state kinetic assays allow us to measure the DNA 
synthesis rates very precisely. To measure the rate of DNA synthesis 
as catalysed by the lagging-strand polymerase, we used a primer— 
template DNA substrate coated with T7 gp2.5 that mimics the already 
unwound lagging-strand template. To measure the rate of leading- 
strand synthesis by the T7 replisome, we used a fork substrate that 
contained the same template sequence in the dsDNA. T7 DNA poly- 
merase copied the gp2.5-coated ssDNA template 30% faster (188 + 10 
nucleotides s ') than the T7 replisome did (132 + 10 nucleotides s © i) 
(Fig. 3). A faster rate of DNA synthesis by T7 DNA polymerase than 
that of the replisome was observed with E. coli single-strand binding 
protein-coated template (158 + 10 nucleotides s~') and without any 
single-strand binding protein (200 + 8 nucleotides s~') (Supplemen- 
tary Fig. 13 and Supplementary Table 3). That the T7 replisome moves 
more slowly than the DNA polymerase alone is consistent with the 
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Figure 2 | Priming loop: the primase domain maintains contact with the 
priming sequence during replication. A, Diagram showing that while the 
primase domain remains bound to the priming sequence to make RNA (red), 
nascent lagging-strand template forms a priming loop. Red A, acceptor; green 
D, donor. B, Fluorescently labelled DNA fork to investigate the priming loop. 
C, Top: Cy3 (donor, green) and Cy5 (acceptor, red) intensity time traces 
during DNA synthesis by T7 replisome. Labels a—d correspond to the DNA 
states in the diagrams in B. Middle: plot of FRET efficiency against time. 
Bottom: representative Cy3 and Cy5 intensity traces on the control fork 
without the priming sequence. D, Plots of cumulative fraction against the 
indicated time intervals determined from single-molecule time traces. The 
time intervals measured are marked by the arrows in C. In all, 40 molecules 
from one experiment were used to build the curves (see Methods). 
Supplementary Fig. 10 shows additional representative time traces. 
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Figure 3 | Lagging-strand synthesis is faster than leading-strand synthesis. 
a, Sequencing gels showing the progressive elongation of fluorescein-labelled 
DNA primer by T7 replisome (200 nM) on 100 nM of fork DNA substrate ts40 
(left) and primer-template DNA substrate p/t40 (right) with T7 gp2.5 (5 uM) 
under the same conditions as in Fig. 1a, without dTo9 trap. b, The rate 
constants of individual nucleotide (nt) addition steps. Nucleotide 
incorporation rates were calculated from the global fits to the polymerization 
model; the average DNA primer elongation rates were 132 = 10 (left) and 
188 + 10 nucleotides s ! (means + s.e.m.). Error bars indicate s.e.m. 
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Figure 4 | Model of T7 DNA replication. The leading-strand template 
(yellow) is copied continuously by the cooperative action of T7 DNA 
polymerase and T7 gp4 while the lagging-strand template (in blue, coated 
with gp2.5) is copied by T7 DNA polymerase through the synthesis of 
Okazaki fragments. The physical coupling of the leading-strand and lagging- 
strand polymerases by interactions between T7 gp4 and gp2.5 creates a 
trombone loop. The priming loop (coated with gp2.5) is created between the 
physically linked primase and helicase domains of T7 gp4 as a result of 
ongoing DNA synthesis during primer synthesis. The priming loop keeps the 
nascent primer within physical reach of the lagging-strand polymerase. On 
primer hand-off to the lagging-strand polymerase, the priming loop 
becomes part of the trombone loop. 


notion that the DNA synthesis rate is limited by the speed of the 
helicase’*. From multiple experiments we estimate that T7 DNA poly- 
merase alone copies the ssDNA template on average 38% faster than 
the T7 replisome does (Supplementary Table 3). From the 30% dif- 
ference in rates, we calculate that the leading-strand polymerase will 
take on average 6-7s longer than the lagging-strand polymerase to 
copy 3,000 bp of DNA, the average length of an Okazaki fragment. 
Unless physical coupling slows its rate, the lagging-strand polymerase 
will reach the end of the previously made Okazaki fragment with 6-7 s 
to spare to pick up a new primer and initiate another round of Okazaki 
fragment synthesis. 

Our not observing replisome pausing is in contrast with previous 
reports'”'? suggesting that T7 replisome pauses during primer syn- 
thesis. DNA synthesis in those studies was measured indirectly by 
following the overall shortening of DNA as dsDNA was converted to 
coiled ssDNA. In the presence of ATP plus CTP, intervals of 5-6 s 
were observed with no change in DNA length, which was attributed 
to replisome stopping. We propose that these pauses are caused not 
by replisome stopping but by the conversion of ssDNA back to 
dsDNA as a result of uncoupled lagging-strand synthesis. Although 
reactions were washed, contaminating polymerase catalysing 
uncoupled DNA synthesis including those tethered by means of 
T7 gp4 (ref. 25) could not be ruled out. Under conditions in which 
excess polymerase was present, both transient loops and pauses were 
observed’*. The possibility could not be ruled out that the pausing 
and looping pattern was caused by separate Okazaki fragment syn- 
thesis events. Those that were coupled showed loop release, and those 
that were uncoupled showed the pausing behaviour. 

On the basis of our studies, we propose that T7 replisome does not 
pause during primer synthesis or any of the steps of lagging-strand 
synthesis. Instead, the following synergistic mechanisms exist to 
coordinate strand synthesis. First, primers are made ahead of time 
during ongoing DNA synthesis; hence, primer synthesis itself does 
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not delay lagging-strand synthesis. Second, the primer is kept in 
physical proximity to the replication complex by means of a priming 
loop that ensures efficient primer use and hand-off to the lagging- 
strand polymerase (Fig. 4). Third, the lagging-strand polymerase 
copies the ssDNA template at a faster rate**, providing extra time 
for the recycling steps. In addition to moving faster, multiple lagging- 
strand polymerases could work at the same time to complete lagging- 
strand synthesis in a shorter time*®. Under certain conditions, the 
lagging-strand polymerase may jump to a new primer before com- 
pletion of the Okazaki fragment, thereby leaving gaps that can be 
filled in later’’. Because the basic mechanism of dsDNA replication is 
conserved from phage to humans'”, the mechanisms revealed from 
studies of the T7 replication proteins are broadly applicable to the 
more complex replication complexes of bacteria and eukaryotes. 


METHODS SUMMARY 

Ensemble kinetic assays. T7 gp4 (ref. 28) and T7 DNA polymerase” (T7 gp5/ 
E. coli thioredoxin) were preassembled on the DNA with dTTP and EDTA in 
replication buffer (50 mM Tris-HCl, pH 7.6, 40 mM NaCl, 10% glycerol) and the 
reactions were initiated with the addition of MgCl, and the rest of the dNTPs, 
with or without ATP, CTP, and dT (90-nucleotide poly(dT) added to trap free 
and dissociated proteins). The kinetics of primer synthesis and DNA synthesis 
was measured with a rapid quenched-flow instrument (KinTek Corp.) and 
products were resolved on 24% or 25% polyacrylamide/urea sequencing gel. 
DNA synthesis kinetics were fitted to the polymerization model (Supplemen- 
tary Information). 

Single-molecule FRET assays. Single-molecule FRET experiments to measure 
unwinding and priming-loop formation were performed on a wide-field total- 
internal-reflection fluorescence microscope with 30ms time resolution and 
imaged by means of a charge-coupled-device camera (iXon DV 887-BI; Andor 
Technology)”. The Cy3 and Cy5 fluorophores were internally labelled on the dT 
through a C, amino linker. Gel-based DNA synthesis reactions were performed 
to confirm that the fluorophores on the DNA did not affect DNA synthesis 
(Supplementary Fig. 7). The priming-loop substrates were prepared by ligating 
donor and acceptor labelled DNAs (Supplementary Information). FRET was 
calculated as the ratio of the acceptor intensity and the total (acceptor plus 
donor) intensity after correcting for cross-talk between the donor and acceptor 
channels and subtracting the background. For Figs 1b and 2, the initiation of 
FRET change and its saturation were scored by visual inspection of the donor and 
acceptor intensities and the time difference between the two points was desig- 
nated as At. The calculated FRET efficiency from this method was demonstrated 
to be robust (Supplementary Fig. 5). The time for photobleaching of the fluor- 
ophores was at least tenfold longer than the unwinding time, and no unwinding- 
like signal was observed without the addition of Mg”. 


Full Methods and any associated references are available in the online version of 
the paper at www.nature.com/nature. 
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METHODS 

Proteins and DNA. T7 gp4A’ and gp5 (exo-) proteins were purified as described 
previously**’*. Thioredoxin from E. coli was purchased from Sigma-Aldrich. 
Protein concentration was calculated by ultraviolet absorption (in 8 M guanidi- 
nium chloride) using extinction coefficients at 280 nm of 0.0836 1M! cm! for 
T7gp4A’' and 0.13442yM_ ‘cm! for T7 gp5 (exo-). Oligodeoxynucleotides 
(Supplementary Table 1) were purchased from Integrated DNA Technology or 
Sigma-Aldrich, and purified by PAGE before use. Substrates for the stopped-flow 
and gel-based DNA-unwinding assays had an amino linker at the 5’ end of their 
bottom strands, which was labelled with 5-(and-6)-carboxyfluorescein succini- 
midyl ester, using the procedure from Molecular Probes with carbonate buffer. 
Proteins were preassembled on the DNA before the start of the reaction: T7 gp4 
was added to the fork substrate with dTTP and EDTA in the replication buffer and 
incubated on ice for 30 min. For assembling T7 replisome, T7 DNA polymerase 
(T7 gp5 and E. coli thioredoxin (1:5) mixed for 5 min at 22 °C in replication buffer 
containing freshly made 5 mM dithiothreitol (DTT)*’) was added to T7 gp4 and 
the DNA mixture and incubated at room temperature (23 + 1 °C) for a further 
30 min. 

RNA primer synthesis assay. The protein-DNA complex was loaded in one 
syringe of the rapid quenched-flow instrument. The second syringe contained 
MgCl, ATP, CTP mixed with a trace amount of [a-*’P]CTP, dATP, dCTP, 
dGTP, and dTo9 trap in replication buffer (50mM Tris-HCl, pH 7.6, 40 mM 
NaCl, 10% glycerol). The reaction was initiated by mixing equal volumes of 
the two solutions at 18°C and quenched after various time intervals with 
300 mM EDTA. Primers generated in the reaction were resolved on 25% poly- 
acrylamide/3 M urea sequencing gel with 1.5 X TBE buffer, running the gel only 
three-quarters of the sequencing gel length. The gel was imaged on a Typhoon 
Phosphorimager, and the products were analysed with ImageQuant 5.0 software. 
The yield of RNA primer synthesis was determined from the radiolabelled CTP 
incorporation in the dimer to pentamer RNA, taking into account the number of 
C bases in the primers. 

DNA synthesis kinetics. T7 replisome-fork DNA or T7 DNA polymerase— 
primer/template DNA complex was loaded in one syringe of the quenched-flow 
instrument. The second syringe contained dATP, dCTP and dGTP, with or 
without NTPs (ATP and CTP), MgCl, and trap (where applicable; see 
Supplementary Table 3) in replication buffer. Reactions were initiated by rapidly 
mixing equal volumes of the two solutions, and quenched after various intervals 
with 300 mM EDTA. The quenched solution was loaded on 22% or 24% poly- 
acrylamide/7 M urea sequencing gel with 1.5 X TBE buffer. The gel was imaged 
with a Phosphorimager, and each DNA band was quantified with ImageQuant 
software. The time courses of the individual DNA product formations and 
decays were fitted to the polymerization model with gfit (http://gfit.sourceforge. 
net) to obtain the individual nucleotide addition rate constants from which the 
average DNA primer elongation rate was calculated. 

Single-molecule FRET experiments and data analysis. Biotin was attached at 
the 5’ end of the DNA strand during DNA synthesis. Cy3 N-hydroxysuccinimido 
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(NHS) ester and Cy5 NHS ester (GE Healthcare) were internally labelled to the 
dT of single-stranded DNA strands by means of a Cs amino linker (modified by 
Integrated DNA Technologies, Inc.). A quartz microscope slide (Finkenbeiner) 
and coverslip were coated with polyethylene glycol (m-PEG-5000; Laysan Bio 
Inc.)'”*° and biotinylated PEG (biotin-PEG-5000; Laysan Bio Inc.). Measure- 
ments were performed in a flow chamber that was assembled as follows. After the 
assembly of the coverslip and quartz slide*’, a syringe was attached to an outlet 
hole on the quartz slide through tubing. All the solution exchanges were per- 
formed by putting the solutions (0.1 ml) ina pipette tip and affixing it in the inlet 
hole, followed by pulling the syringe. The solutions were added in the following 
order. Neutravidin (0.2 mg ml '; Pierce) was applied to the surface and washed 
away with T50 buffer (10 mM Tris-HCl, pH 8, 50 mM NaCl). Biotinylated DNA 
(about 50-100 pM) in T50 buffer was added and washed away with imaging 
buffer (10mM Tris-HCl, pH 8, 50mM NaCl, 0.1 mg ml! glucose oxidase, 
0.02 mg ml“! catalase, 0.8% dextrose, plus Trolox)*'. For replisome measure- 
ments, T7 gp4 (50nM hexamer) and T7 DNA polymerase (gp5/thioredoxin) 
(50nM) were loaded on the DNA with 2mM dTTP, 5mM DTT and 5mM 
EDTA in imaging buffer, and incubated for 10min. After a few seconds of 
imaging, unwinding and polymerase synthesis were initiated by the addition 
of the rest of the dNTPs (1mM each), 1mM ATP, 1mM CTP, 5mM DTT 
and 4mM free Mg”* in imaging buffer. All measurements were made at room 
temperature. 

FRET values were calculated as the ratio between the acceptor intensity and the 
total (acceptor plus donor) intensity after correcting for cross-talk between the 
donor and acceptor channels and subtracting the background. For the unwinding 
experiment shown in Fig. 1b, the initiation of FRET decrease and its saturation 
were scored by visual inspection of the donor and acceptor intensities and the 
calculated FRET efficiency, and the time difference between the two points was 
designated the Atvalue of each reaction. Once we had identifed a sustained FRET 
decrease below 0.5, the first time point at which FRET value dropped below the 
average FRET value before unwinding began, typically about 0.8, was designated 
the initiation point. Similarly, once we had identified a saturation in FRET 
decrease, the first time point at which FRET reached the average FRET value in 
the saturation plateau was designated the saturation point. We demonstrate that 
the error in determining the initial time point of FRET decrease by this method is 
negligible (Supplementary Fig. 5). The FRET increase time in the priming-loop 
experiment was plotted as the cumulative fraction against time to indicate the 
fraction of molecules that had completed the FRET increase up to a given time 
point. The donor intensity time was the time between the initial donor signal 
increase and the final disappearance of fluorescence signal, also determined by 
visual inspection. The donor intensity time was also plotted in the format cumu- 
lative fraction versus time. All data were analysed with scripts written in MATLAB 
and plotted in Origin. 


31. Rasnik, |., McKinney, S. A. & Ha, T. Nonblinking and long-lasting single-molecule 
fluorescence imaging. Nature Methods 3, 891-893 (2006). 
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CORRECTIONS & AMENDMENTS 


ERRATUM 


doi:10.1038/nature08600 


SSB protein diffusion on single-stranded 
DNA stimulates RecA filament formation 
Rahul Roy, Alexander G. Kozlov, Timothy M. Lohman & Taekjip Ha 


Nature 461, 1092-1097 (2009) 


In this Article, Figure 3 was printed incorrectly with missing labels. 
The correct figure is shown below. 
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Rekindling the gender-bias debate 


It is almost five years since Harvard University's 
then-president Lawrence Summers made 

his controversial remarks about the innate 
differences between men’s and women’s skills 
in science and maths. Now, a book is reigniting 
the debate with equally contentious views 
about why female scientists have problems 
advancing in academia. The Science on Women 
in Science (AEI Press; 2009) is a collection of 
nine essays assembled and edited by former 
philosophy professor Christina Hoff Sommers. 
The book's aim, says Sommers, is to convince 
policy-makers and others that gender bias 
may not be the primary cause of the under- 
representation of women in science. 

Six of the nine essays, including Sommers's 
own contribution, contend that innate gender 
differences, not bias, prompt men to be more 
interested than women in science, maths 
and engineering. Published by the American 
Enterprise Institute for Public Policy Research 
(AEI), a right-leaning think tank based in 
Washington DC, the book is Sommers's 
rejoinder to Beyond Bias and Barriers, a 2007 
report by the US National Academy of 
Sciences, the National Academy of Engineering 
and the Institute of Medicine. That study found 
that under-representation of women in high- 
level academic posts is caused largely by bias 
and by an institutional framework that hinders 
women's advancement. 


Gender gap 
In the collection’s first essay, developmental 
psychopathologist Simon Baron-Cohen of 
the University of Cambridge, UK, suggests 
that men are more interested in analysing 
the variables in a system to determine its 
rules — or ‘systemizing’ — whereas women 
are more interested in empathizing. Baron- 
Cohen refers to psychological and behavioural 
studies that show that boys prefer certain toys 
and perform better than girls on some tests, 
and that men dominate occupations such as 
metalworking. These findings provide strong 
evidence, he writes, that men are more drawn 
to systems-based operations than women, 
and that women are more likely to prefer and 
be better at empathy-based activities than 
men. To highlight this, he notes that women's 
conversations involve more discussion about 
feelings compared with those of men. 

But Elizabeth Spelke, a psychologist 
at Harvard University in Cambridge, 
Massachusetts, contends that the gender 
gap is due to societal and historical reasons. 
She cites behavioural, physiological and 
psychological studies suggesting that male 
and female infants are equally interested ina 
variety of objects (in contrast to the traditional 
view that girls are more interested in people 


than in objects); that girls and boys have an 
equal ability to learn the words that represent 
numbers and to count; and that girls can learn 
to read a map as well as boys can. “There is 

no evidence for a male advantage in intrinsic 
aptitude or motivation for mathematics and 
science,” she writes. Spelke concludes that core 
cognitive and motivational patterns in women 
and men are much the same and that they play 
no part in the smaller numbers of women than 
men in science, maths and engineering. 


Contrary to the view of Sommers and others, 
Rosalind Chait Barnett, a senior scientist at the 
Women's Studies Research Center at Brandeis 
University in Waltham, Massachusetts, 
dismisses the notion that women prefer to 
avoid science and engineering because of their 
gender. In their essay, she and her co-author, 
social psychologist Laura Sabattini, argue that 
academia in the United States has historically 
been inhospitable to women scientists. College 
requirements of the mid-nineteenth century 
stipulated that women faculty members 
be single, they point out, and the twentieth 
century's anti-nepotism policies often meant a 
female scientist could not be employed at the 
same university as her husband. “Preferences 
are learned,” says Barnett, noting that today, 
many women deliberately avoid the only 
occupations that employed them years ago. 
“The whole thing is completely culturally 
determined,” she says. 

Other studies have reached similar 
conclusions: Reshma Jagsi at the University 
of Michigan and her colleagues found that 
women are less likely than men to receive 
major funding for early-career clinical 
research (R. Jagsi et al. Ann. Intern. Med. 151, 
804-811; 2009). The authors blame family 


© 2009 Macmillan Publishers Limited. All rights reserved 


responsibilities and the excessive demands 
of clinical studies that take time away from 
research projects, suggesting that women 
are less successful than men at negotiating to 
reduce their clinical workload. 

Sommers objects to initiatives such as the 
US National Science Foundation’s ADVANCE 
programme, which seeks to increase women's 
representation and advancement in science, 
technology, engineering and maths. She also 
opposes the US government's application of 
‘Title IX’ requirements — which bar gender 
discrimination in any federally funded 
operation — to academic science departments 
because, she believes, it could exclude male 
scientists in favour of females. 


Advancing the debate 
In her essay, ‘Sex, Science and the Economy’, 
she argues that quota-driven and gender- 
balanced academic research is likely to be 
of lower quality than research conducted in 
departments without quotas. To support this, 
Sommers refers to studies suggesting that 
professions are partly biologically determined, 
and includes anecdotal evidence from female 
scientists who have not experienced bias. 
“Before the government rushes in to correct 
the problem, they have to be sure it exists and 
they also have to understand it. I'm trying to 
create a healthy scepticism,” says Sommers. 
“I'm not absolutely convinced there isn’t any 
bias,” she says, but there are “more plausible 
explanations” for academic gender disparity. 

But institutional bias has already been 
shown to be the main obstacle to women's 
ascension in academic science and 
engineering, says Mary Hall Reno, professor 
and chair of the department of physics and 
astronomy at the University of lowa in lowa 
City, and outgoing chair of the American 
Physical Society's Committee on the Status 
of Women in Physics. She gives the example 
of how bias was identified and addressed in 
the medical profession: “Forty years ago, the 
thought was that women couldn't be medical 
doctors, and now they have thronged into the 
field,” Reno says. “| thought we were past this. 
We don't need more debate.” 

Phoebe Leboy, president of the Association 
for Women in Science in Washington DC, 
says that the book “cannot be ignored”, noting 
its potential to prompt the overturn of anti- 
discrimination legislation or programmes such 
as ADVANCE if lawmakers were to act onits 
conclusions. But she thinks that scenario is 
unlikely. “This book will preach mostly to the 
converted, and the converted are a relatively 
small group of people with unusually good 
access to the media,” she says. a 
Karen Kaplan 
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Japanese mentors reap their awards 


The winners of the sixth annual 
Nature Awards for Mentoring 

in Science, announced on 

1 December, have had very 
different career paths. One has 
been doing the rounds at Japan's 
elite academic institutions for 
decades; the other is a slightly 
eccentric product of industrial 
laboratories. But they share 
scientific initiative as well as 

an open-mindedness to their 
students’ ideas and a willingness 
to foster their students’ 
independence. 

The lifetime achievement 
award went to biophysicist 
Fumio Oosawa, who pioneered 
molecular studies on muscle 
contraction and helped invent 


Winning mentors: Hiroaki Kitano (left) and Fumio Oosawa. 


with the more-familiar honorific 
suffix ‘san’ rather than the formal 
‘sensei’. “He had little time for 
authoritarianism,” says one 
student. Instead of an office, 

he had a table at the end of the 
laboratory, but he rarely used it, 
preferring to walk about and talk to 
students. Instead of the Oosawa 
‘lab’, students spoke of a sort of 
nurturing open classroom where 
students could come and go with 
ease. Oosawa says he was careful 
not to push for a debate about 
findings while a student was still 
in the middle of collecting data. 

“I wait,” he says. “I don't judge 

by the speed of the work. Having 
interesting ideas is the most 
important thing.” Breaking down 


the field of single-molecule 

biology. The midterm career award was won 
by computer scientist Hiroaki Kitano, who 

is renowned for his expertise in robotics and 
systems biology. The prizes, which were 
presented by Philip Campbell, editor-in-chief 
of Nature, at a reception in Tokyo, each carried 
a purse of ¥1.5 million (US$17,000). 

Like all the winners of Nature's mentor 
awards, Oosawa and Kitano share certain 
characteristics, says Campbell. They have 
broad vision, a keen interest in challenging 
students and a willingness to help them 
network to advance their careers. 

Oosawa received his physics degree in 1944 
from the University of Tokyo before moving 
on to Nagoya University and then Osaka 
University. Now 77, he initiated his research on 
muscle contraction in the 1960s by analysing 
the polymerization of actin. Kitano, 48, heads 
Sony Computer Science Laboratories in 
Tokyo. He began his career with information- 
technology company NEC in Tokyo. 


Great support 
Testimony from current and former students, 
who nominated the pair for the awards, reveals 
some of the characteristics shared by superior 
mentors. Kitano and Oosawa both cultivate 
independence in their researchers. Kitano, for 
example, routinely names laboratory members 
as corresponding authors on published papers. 
Both were praised for using their networks of 
colleagues and contacts to introduce their lab 
members to other established researchers in 
the field. “A lot of professors just don't take 
the time to do that,” says Samik Ghosh, a 
researcher at the Systems Biology Institute in 
Tokyo, headed by Kitano. 

Speaking at the awards ceremony, Akiyoshi 
Wada, who headed the judging committee, 
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said: “It's obvious that one generation of 
scientists should cultivate the next. But it’s 
too obvious — so obvious that people take 
it for granted.” Oosawa and Kitano did not, 
Wada added. 

Kitano and Oosawa share a broad-minded 
approach to ideas that stimulate their 
students. “He takes an interest in every 
project,” wrote one of Oosawa’s nominators. 
Ina written statement, Oosawa said that 
he makes sure to “respect the ideas of my 
students. | never said things like, ‘I already 
thought of that’.” 


Absurd ideas 

One student who nominated Kitano wrote: 
“He is ready to invest wholeheartedly in 
absurd ideas, and in some cases it seemed 
the more absurd the better.” Kitano’s ‘absurd’ 
ideas spawned RoboCup — a seminal 
competition that aims to produce robots, 
currently competing against each other, that 
can take ona human team — and the idea of 
a ‘virtual human’ — a computer model of a 
person, used in drug development (see Nature 
451, 879; 2008). 

Such radical ideas appeal to young Japanese 
researchers. Hiroki Ueda of the Center for 
Developmental Biology, a former student in 
Kitano’s laboratory and one of Japan's most 
prodigious talents, plays down the importance 
of the science knowledge he gleaned from 
working with Kitano. “| learned nothing in his 
laboratory,” he says playfully. But Ueda, who 
considers himself a radical like Kitano, says 
he picked up “the atmosphere”, which helped 
convince him that a career can be founded on 
unconventional ideas. “That's so cool,” Kitano 
said when told of Ueda’s comment. 

Inthe laboratory, students address Oosawa 
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barriers made Oosawa an early 
force in equal rights: in the 1960s, at a time 
when few Japanese women could find a career 
in science, a third of his laboratory was female. 


Tough environment 

The accomplishments of Oosawa and Kitano 
are all the more impressive considering their 
research environment. In Japan, a strong 
mentor-protégé culture often promotes 
deference rather than independence. 

There are also systemic issues that can 
hinder the proliferation of great mentors, says 
Hitoshi Murayama, a theoretical physicist 
with positions at the University of Tokyo, the 
Lawrence Berkeley National Laboratory in 
California and the University of California, 
Berkeley. Japanese universities generally do 
not provide stipends to graduate students, so 
many have to work part-time to get through 
school. In Japan, professors can take anyone 
who comes, without necessarily worrying 
about the quality or future of their students. 
By contrast, US professors generally invest 
in young researchers by providing financial 
support from their grant monies, and so 
have more at stake. “Professors don’t take on 
students unless they are serious about it,” says 
Murayama, noting an upside to the US system. 
“This definitely creates a difference in the 
mentor-student relationship.” 

Oosawa and Kitano, though, have managed 
to thrive, and to help their students do 
the same — and both their own and their 
students’ science careers have benefited as 
aresult. As Campbell noted at the awards 
ceremony: “Not every outstanding scientist 
is agood mentor, but all outstanding mentors 
are outstanding scientists.” | 
David Cyranoski 
See Editorial, page 826. 
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Rejuvenation 


The chase is on. 


Julian Tang 


“So, can anyone tell me about the Hoffman 
vortex?” 

Mr Murphy looked expectantly at his 
small A-level physics class as some clusters 
of excited murmuring broke out. 

“Conferring is allowed,” he chuckled. 
“Catherine, Rebecca, come on! Tell me 
what you think before these rowdy boys 
beat you to it!” 

Looking at her and grinning, Rebecca 
nudged Catherine, urging her to speak 
first. 

“Well,” Catherine began hesitantly, as the 
boys looked up, curiously. “Discovered by 
the physicist, Hans Hoffman, it was found 
that when two rings of a specific diameter 
and thickness of the radioactive element 
zylerium are counter-rotated at a specific 
rate about the same axis, at a specific dis- 
tance apart, a vortex of zyleron particles is 
produced between the two rings.” 

Rebecca clapped her on the back, con- 
gratulating her on her answer. 

“Perfect!” exclaimed Mr Murphy in 
delight. “Now boys, what does this vortex 
do? Come on now, don't let the girls show 
you up!” 

“Well, it was only one girl, sir” retorted 
Paul, leader of one of the boys’ ‘think tank 
clusters. 

“True, I stand corrected,” replied Mr 
Murphy, in mock humility. 

Jonathan, the leader of one of the other 
boy think tanks, raised his right hand. 

“Ah! Jonathan! What can you offer me 
on this fine day?” Mr Murphy looked at 
him, hopefully. 

Jonathan scratched his right ear, a habit 
of his before answering a question in class. 
“The Hoffman vortex created by two rings 
of zylerium, arranged in the way that 
Catherine described, has been postulated 
to reverse the effects of ageing, mainly by 
reversing the effects of solar radiation and 
other ageing processes at a cellular level” 

“Yes, well done! We're firing on all 
thrusters today, class!” Mr Murphy grinned 
widely and clapped his hands, as Jonathan's 
friends punched him, playfully. 

“OK, now Paul, have you anything else 
to add?” Mr Murphy clasped his hands on 
his stomach and twiddled his thumbs in 
expectation. 

After some brief, last-minute conferring 
with his friends, Paul cleared his throat 
loudly (something that he always did 
before speaking in front of the class). 

“Yes, all of that is fine, but these effects 


have only been shown in microorganisms, 
such as bacteria. The problem is that the 
amount of zylerium we can produce and 
purify at this time is in the order of mil- 
ligrams. To make zylerium rings large 
enough to produce a man-sized Hoffman 
vortex will need a much more efficient 
way to produce zylerium,” he finished 
proudly. 

“Excellent!” said Mr Murphy, bowing 
slightly in Paul’s direction. 

Then another student, Ben, put his hand 
up. He was relatively new, having joined 
the class at the beginning of the current 
school-term. Quietly assertive and intel- 
ligent, he seemed to be mature beyond 


his teenage years. He had quickly gained 
the respect (and in some cases, awe) of his 
classmates and teachers. 

“Yes, Ben,” nodded Mr Murphy, with 
interest. 

“There is another hypothesis, sir,” he 
started, carefully. “The addition of a third 
zylerium ring creates a modified Hoffman 
vortex, which allows space-time travel, on 
top of the rejuvenation effect?” Ben smiled, 
knowingly, at him. 

All the students looked at him in amaze- 
ment. 

“Where did you hear that, Ben?” blurted 
out Paul. “Has it been published some- 
where, yet?” 

Ben shook his head slightly. “Let's just say 
that this was a ‘personal communicatior of 
some sort,’ replied Ben, evasively. He was 
still looking at Mr Murphy, intently. 

Mr Murphy returned his look, impas- 
sively for a moment then glanced around, 
quickly. “OK class, you can all leave a bit 
earlier today. Ben, can you stay behind for 
a minute?” 
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Ben nodded, still smiling at him and 
remained seated. 

Bemused, the rest of the students filed out 
in silence, leaving the two of them alone. 

Mr Murphy shut the door then sat on 
the front bench, facing Ben. 

“So, well done Ben, or shall I call you by 
your real name, Jan Bendarian, the bounty 
hunter? The rejuvenation process suits 
you,’ Mr Murphy sneered. 

Ben/Jan did not move from his seat. “We 
don't care about you, Murphy, we just want 
the device. What are you planning to do? 
Reduce this world’s population to a bunch 
of babies and let them starve to death at 
your whim?” 

“What? Me? Be so inhumane? Never!” 
Murphy seemed sincere in this. “This 
world has treated me well. It accepted me 
as one of them, even letting me teach their 
young minds. I had planned to settle here 
into my old age, peacefully. Now, you've 
spoiled all that, he finished, grimly. 

While he was saying this, Murphy had 
casually moved back behind the bench, in 
front of the blackboard. Now, suddenly, he 
hit a hidden switch under the bench-top 
then, waving and laughing at Ben/Jan, he 
walked backwards through the blackboard 
and wall behind him, disappearing. The 
blackboard and wall reformed after him. 

Ben/Jan, still seated, sighed and took 
out a small device from his pocket. Yes, the 
microscopic tracking beacon he had slipped 
into Murphy’s coffee last week was still func- 
tioning. It would have lodged somewhere 
in his gut, where it would remain, unless 
Murphy had it surgically removed. 

Examining the bench and blackboard, he 
found the switch easily and entered Mur- 
phy’s inter-dimensional storage area behind. 
Murphy had long gone to some other space- 
time destination, probably already with a 
different age and disguise, with the help of 
the Hoffman vortex generator. 

What the hell, he thought. He had already 
pursued him through five locations, what 
was one more? He always enjoyed a good 
chase. Chuckling to himself, he turned on 
his belt-mounted Hoffman field (a more 
advanced, portable form of the Hoffman 
generator) whereupon his image shim- 
mered briefly, then he too disappeared. 
Julian Tang is a clinical/academic 
virologist. This story is dedicated to his 
real ‘A’-level physics class led by the real 
Mr Murphy, who kept us all stimulated 
with his humour and sarcastic wit. 
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